CN112992370B - Unsupervised electronic medical record-based medical behavior compliance assessment method - Google Patents
Unsupervised electronic medical record-based medical behavior compliance assessment method Download PDFInfo
- Publication number
- CN112992370B CN112992370B CN202110489454.XA CN202110489454A CN112992370B CN 112992370 B CN112992370 B CN 112992370B CN 202110489454 A CN202110489454 A CN 202110489454A CN 112992370 B CN112992370 B CN 112992370B
- Authority
- CN
- China
- Prior art keywords
- medical
- data
- patient
- diagnosis
- treatment
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 61
- 238000003745 diagnosis Methods 0.000 claims abstract description 48
- 230000008569 process Effects 0.000 claims abstract description 37
- 230000005856 abnormality Effects 0.000 claims abstract description 10
- 230000000694 effects Effects 0.000 claims abstract description 8
- 238000011156 evaluation Methods 0.000 claims abstract description 7
- 238000005065 mining Methods 0.000 claims abstract description 5
- 238000003780 insertion Methods 0.000 claims description 18
- 230000037431 insertion Effects 0.000 claims description 18
- 238000004458 analytical method Methods 0.000 claims description 7
- 230000009471 action Effects 0.000 claims description 6
- 201000010099 disease Diseases 0.000 claims description 6
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 claims description 6
- 238000005516 engineering process Methods 0.000 claims description 5
- 238000004140 cleaning Methods 0.000 abstract description 4
- 238000007781 pre-processing Methods 0.000 abstract description 4
- 230000006399 behavior Effects 0.000 description 16
- 230000006870 function Effects 0.000 description 9
- 230000001965 increasing effect Effects 0.000 description 5
- 230000002159 abnormal effect Effects 0.000 description 4
- 238000003759 clinical diagnosis Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000007418 data mining Methods 0.000 description 2
- 230000036541 health Effects 0.000 description 2
- 230000001939 inductive effect Effects 0.000 description 2
- 238000010801 machine learning Methods 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 208000019553 vascular disease Diseases 0.000 description 2
- 206010020751 Hypersensitivity Diseases 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 208000026935 allergic disease Diseases 0.000 description 1
- 230000007815 allergy Effects 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 208000015181 infectious disease Diseases 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 230000005477 standard model Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H50/00—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
- G16H50/70—ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computing arrangements based on specific mathematical models
- G06N7/02—Computing arrangements based on specific mathematical models using fuzzy logic
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/60—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Public Health (AREA)
- Pure & Applied Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Medical Informatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Computational Mathematics (AREA)
- Mathematical Analysis (AREA)
- Mathematical Optimization (AREA)
- Databases & Information Systems (AREA)
- Epidemiology (AREA)
- Algebra (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Computation (AREA)
- Biomedical Technology (AREA)
- Evolutionary Biology (AREA)
- Primary Health Care (AREA)
- Probability & Statistics with Applications (AREA)
- Operations Research (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Pathology (AREA)
- Automation & Control Theory (AREA)
- Fuzzy Systems (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Medical Treatment And Welfare Office Work (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Measuring And Recording Apparatus For Diagnosis (AREA)
Abstract
The invention discloses an unsupervised medical behavior compliance assessment method based on an electronic medical record, and the method comprises the following steps of S1, collecting, cleaning and preprocessing case data; s2, classifying the case data of the patient; s3, clustering the order data of the patients; s4, fusing the medical advice data after patient clustering and the operation data with time series, and mining a diagnosis and treatment process model according to the patient category and the effect of the patients after diagnosis and treatment; and S5, aligning the actual diagnosis and treatment sequence with the excavated diagnosis and treatment process model based on the cost function, positioning the position of the abnormality and calculating the deviation degree of the abnormality. The invention reduces the dependence of prior knowledge, can deeply utilize data, has strong clinical interpretability of an evaluation result, and has high preset logic complexity of the state of illness, physical condition and the like of a patient.
Description
Technical Field
The invention relates to the field of medical data processing and analysis, in particular to an unsupervised electronic medical record-based medical behavior compliance assessment method.
Background
In recent years, along with the increasing living standard of people, the development of the medical health industry also meets a plurality of problems. On one hand, medical expenses are increasing at a relatively fast speed, and in a clinical diagnosis and treatment process, due to interference of benefits of medical institutions, a phenomenon that clinical medical behaviors are unreasonable exists in some patients, so that medical resources are wasted, economic burden of the patients is increased, and even physical health of the patients can be possibly damaged. On the other hand, in the clinical diagnosis and treatment process, the problems of insufficient intervention flow and standard mastering of medical staff to the guideline requirements, insufficient compliance to the guideline requirements and the like exist, so that the phenomenon of non-compliance of medical behaviors is caused, the number of hospitalization days of a patient is increased, and the infection rate and the death rate of the patient are correspondingly increased.
With the advent of the big data era, a lot of valuable medical data are recorded in electronic medical records, but how to utilize artificial intelligence and informatization technology to enable the electronic medical record data to be better mined and utilized is a difficult problem which needs to be solved urgently. By establishing a set of medical behavior compliance assessment system based on the electronic medical records, the conventional electronic medical record data is mined and analyzed, technical assistance can be provided for medical workers, and the quality and efficiency of clinical diagnosis and treatment are greatly improved.
Most of traditional evaluation systems based on machine learning methods rely on prior knowledge of experts in related fields to label data, and machine learning methods based on behavior analysis learning have long learning time, but actually have relatively less labeled data, and data mining of electronic medical records is more suitable for semi-supervised or unsupervised data driving methods;
in the prior art, most of the prior art only utilizes the information of the patient charge item, and the information of the patient admission examination result, the allergy condition, the medical advice and the like is not considered comprehensively, so that the information utilization condition is not deep enough;
the prior art does not consider the clinical value of each index, and the evaluation result has insufficient clinical interpretability;
the existing model considers the uniformity of the model too much and depends on the difference of the illness state and the physical condition of different patients, so that the precision is low, the adaptability is poor, and meanwhile, the preset logic rule of the early warning system is more specific to a single disease type and a simple clinical scene.
Disclosure of Invention
The invention aims to provide an unsupervised medical behavior compliance assessment method based on an electronic medical record, which reduces the dependence of prior knowledge, can deeply utilize data, has strong clinical interpretability of assessment results and high preset logic complexity of patient conditions, physical conditions and the like.
In order to achieve the purpose, the invention is realized by adopting the following technical scheme:
the invention discloses an unsupervised electronic medical record-based medical behavior compliance assessment method, which comprises the following steps of:
s1, case data are collected, cleaned and preprocessed, the case data comprise personal information, admission data, medical history data, examination data, diagnosis and treatment results, medical operation data and hospitalization data of a patient, the medical operation data comprise medical order data and operation data, and the medical order data and the operation data are in a time series form;
s2, classifying the case data of the patient according to the personal information, the medical history data, the examination data, the diagnosis data and the diagnosis and treatment results of the patient, and constructing a fuzzy concept with similar index values;
s3, clustering the order data of the patients;
s4, integrating the medical advice data and the operation data after patient clustering, and mining a diagnosis and treatment process model according to the category of the fuzzy concept to which the patient belongs and the effect of the patient after diagnosis and treatment;
s5, customizing a cost function of the diagnosis and treatment process model, aligning the actual diagnosis and treatment sequence with the excavated diagnosis and treatment process model based on the cost function, positioning the position of the abnormality and calculating the deviation degree of the abnormality.
Preferably, in step S2, using fuzzy form concept analysis theory, each historical patient with a complete clinical path is regarded as an object of the fuzzy form background, each type of index is regarded as an attribute of the fuzzy form background, the value of the form background is normalized, and a threshold is set for each attribute and similar disease patients are merged, so as to simplify the fuzzy form background and construct fuzzy concepts, each fuzzy concept represents a specific patient group with similar index values.
Preferably, in step S3, the order data is first clustered by using a multi-granularity topic model, then the order data after topic clustering is clustered by using a K-means + + algorithm to cluster the order data after topic clustering by day, so as to reduce the difficulty of medical behavior compliance assessment,
if the number of subjects in the order data is t, the similarity between the patient i and the patient j on the m-th day and the n-th day is described as follows:
Disi,m=(pim1k1,pim2k2,…,pimtkt) (2)
wherein D represents the order data, Disi,mRepresenting the probability distribution of the subject over a total of t dimensions on day m for patient i, p representing the probability of the subject, k representing the weight of the corresponding subject, S (D)i,m,Dj,n) Represents the similarity between patient i and patient j on day m and day n, pimtktA topic probability distribution representing the patient i day m dimension t topic vector.
Preferably, in step S4, the subdivided "cured" or "improved" patient data is mined using the Imf process discovery algorithm in the ProM process mining software.
Preferably, in step S5, on the premise of the frequency of the specific medical action, the cost function of the medical procedure model based on the TF-IDF weighting technology is used to quantify the cost of the inserted or skipped medical action, and the alignment between the actual medical procedure sequence and the standard medical procedure model is realized through the pnetpply plug-in the ProM procedure mining software, so as to determine the position and deviation degree of the abnormality.
Preferably, in step S5, the insertion cost Cos t (x) at the time of the insertion event x of the medical sequence Seq is specifically described as follows:
wherein N (Seq) is the total number of occurrences of the treatment sequence Seq, N (Seq)xThe number of occurrences of the medical sequence Seq including the insertion event x, N is the number of samples of all medical sequences, N (x) is the number of medical sequences including the insertion event x,
TF-IDF (Term Frequency-Inverse Document Frequency) is a commonly used weighting technique for information retrieval and data mining, where TF is Term Frequency (Term Frequency) in equation (3) and IDF is Inverse text Frequency index (Inverse Document Frequency) in equation (4).
The invention has the beneficial effects that:
1. the invention provides a mining scheme integrating more comprehensive data, and more comprehensively considers information of various aspects of patients.
2. The invention introduces the fuzzy form concept analysis theory to subdivide the patient population, so that the range of the reference data of the process model is finer, and the adaptation degree of the standard process model and the actual diagnosis and treatment sequence is improved.
3. The Multi-granularity Topic Model clustering method (M-GTM, Multi-gain Topic Model) adopted by the invention is obviously superior to the common LDA Topic Model clustering in use effect.
4. The method of the invention introduces TF-IDF weighting technology when calculating the frequency of medical behaviors, thereby improving the accuracy of cost function calculation.
5. The medical behavior compliance evaluation flow based on the electronic medical record does not need to label abnormal medical data, and is an unsupervised data driving method.
Drawings
FIG. 1 is a schematic flow chart of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to the accompanying drawings.
As shown in fig. 1, the present invention comprises the steps of:
s1: collecting, cleaning and preprocessing case data, wherein the case data comprises admission data, hospitalization data, medical history data, various examination data and medical advice data of a patient;
s2: classifying the patients according to the different conditions of the personal information, various examination data, the patient and family medical history and diagnosis data of the patients and the effect after diagnosis and treatment of the case data;
s3: clustering medical advice data of a patient, wherein the medical advice data is in a time series form;
s4: medical advice data after patient clustering and medical operation data with time series such as operations are fused, the effect of the patients after diagnosis and treatment is comprehensively considered according to the subdivided patient categories, and a relatively more effective diagnosis and treatment process model is excavated;
s5: and customizing a cost function of the diagnosis and treatment process model, aligning the actual diagnosis and treatment sequence with the excavated diagnosis and treatment process model based on the cost function, positioning the position of the abnormality and calculating the deviation degree of the abnormality.
In step S2, a fuzzy form concept analysis theory is introduced, each patient with a history of complete clinical paths is regarded as an object of the fuzzy form background, and each type of index is regarded as an attribute of the fuzzy form background. And then, carrying out normalization processing on the values of the form background, setting a threshold value for each attribute and combining similar disease patients so as to simplify the fuzzy form background. Construction of a grid of fuzzy concepts may then be performed, each fuzzy concept representing a particular patient population with similar metric values.
In step S3, the medical order data is first clustered by using a Multi-granularity Topic Model (Multi-gain Topic Model), and then the medical order data after Topic clustering is clustered by day by using a K-means + + algorithm, so as to reduce the difficulty of medical behavior compliance evaluation. If the number of subjects in the order data is t, the similarity between the patient i and the patient j on the m-th day and the n-th day is described as follows:
Disi,m=(pim1k1,Pim2k2,…,Pimtkt) (2)
therein, Disi,m=(pim1k1,pim2k2,…,pimtkt) Represents the probability distribution of the subject over a total of t dimensions on the m-th day for patient i, p represents the probability of the subject, and k represents the weight of the corresponding subject.
In step S4, a Imf (Inductive Miner-frequency) process discovery algorithm in the ProM process mining software is used to mine the diagnosis and treatment process model for the subdivided "cured" or "improved" patient data.
In step S5, on the premise of how frequently the specific medical action is performed, the cost of the medical action in the form of insertion or skipping is quantified by using the cost function of the diagnosis and treatment process model based on the TF-IDF weighting technique. Taking the insertion event x of a certain medical sequence Seq as an example, the insertion cost (x) is specifically described as follows:
where N (Seq) is the total number of occurrences of the medical sequence Seq, N (Seq) x is the number of occurrences of the medical sequence Seq including the insertion event x, N is the number of samples of all medical sequences, and N (x) is the number of medical sequences including the insertion event x.
And then, aligning the actual diagnosis and treatment sequence with the standard diagnosis and treatment process model through a PNetRelyer plug-in the ProM process mining software, and finally judging the abnormal position and the deviation degree.
In practical use, taking a large vessel disease as an example, the implementation process is as follows:
1. collecting, cleaning and preprocessing an electronic medical record of a patient with a large vascular disease:
collecting and sorting the electronic medical record data of all patients with the major vascular disease, and selecting admission data, hospitalization data, medical history data, various examination data and medical advice data of the patients in the electronic medical records.
Then data cleaning and preprocessing are carried out, including unifying the naming of similar items or diagnosis and treatment operations, eliminating the clinical path of midway insertion or exit and cases of invalid treatment, merging the same diagnosis and treatment operations or medical orders at the same time and the like.
2. Patient type classification:
(1) according to the fuzzy form concept analysis theory, firstly, a fuzzy form background is constructed: the method comprises the steps of sorting different conditions of personal information, various examination data, patient and family medical history and diagnosis data of a patient and information of several dimensionalities of the effect after diagnosis and treatment, and setting the attribute of a fuzzy form background according to the information.
(2) The specific information of each patient is converted into the membership degree corresponding to each attribute, and the membership degrees of all the attributes need to be normalized.
(3) The fuzzy form background is reduced by setting appropriate thresholds for the degree of membership of each attribute and merging similar patients. The membership degree of the attribute can be reasonably divided by means of expert experience, or the change condition of the membership degree of the attribute can be fitted into a normal distribution according to historical data, a proper confidence interval (for example, a confidence interval of 80% is set, and a single-side or double-side confidence interval is set) is selected, the membership degree outside the confidence interval is set to be 0, and the membership degree in the confidence interval is set to be 1.
(4) And constructing a concept lattice. The granularity of patient classification may be determined by selecting a hierarchy of concept lattices.
3. Doctor advice data clustering module:
for similar patients, the diagnosis and treatment schemes of each day are similar, so that firstly, medical order data are clustered by adopting a Multi-granularity Topic Model M-GTM (Multi-gain Topic Model), if the Topic clustering precision is improved, a little prior knowledge of medical order data classification can be added, then, the medical order data after Topic clustering are clustered by adopting a K-means + + algorithm, and the medical order data after Topic clustering are clustered by day according to the similarity, so that the difficulty of medical behavior compliance evaluation is reduced. If the number of subjects in the order data is t, the similarity between the patient i and the patient j on the m-th day and the n-th day is described as follows:
Disi,m=(pim1k1,Pim2k2,…,Pimtkt) (2)
therein, Disi,m=(pim1k1,pim2k2,…,pimtkt) Represents the probability distribution of the subject over a total of t dimensions on the m-th day for patient i, p represents the probability of the subject, and k represents the weight of the corresponding subject.
4. Excavating a diagnosis and treatment process model:
medical order data after patient clustering and medical operation data with time series such as operations are fused, the effect of the patients after diagnosis and treatment is comprehensively considered according to subdivided patient categories, Imf (Inductive Miner-frequency) process discovery algorithm is selected through Prom process mining software, the patient category data of 'cured' and 'improved' are selected for mining of diagnosis and treatment process models, and relatively more effective diagnosis and treatment process models are mined.
5. And (3) discovering the abnormal medical behavior:
(1) on the premise of the frequency of specific medical behaviors related to the large vessel diseases, the cost of the medical behaviors in the forms of insertion or skipping and the like is quantified by adopting a diagnosis and treatment process model cost function based on the TF-IDF weighting technology. Taking the insertion event x of a certain medical sequence Seq as an example, the insertion cost (x) is specifically described as follows:
where N (Seq) is the total number of occurrences of the medical sequence Seq, N (Seq) x is the number of occurrences of the medical sequence Seq including the insertion event x, N is the number of samples of all medical sequences, and N (x) is the number of medical sequences including the insertion event x.
(2) And aligning the actual diagnosis and treatment sequence with the excavated diagnosis and treatment process standard model based on the cost function through a PNetRelyer plug-in the ProM software, and finally judging the abnormal position and the deviation degree.
There are, of course, many other embodiments of the invention and modifications and variations which will be apparent to those skilled in the art without departing from the spirit and scope of the invention as defined in the appended claims.
Claims (6)
1. An unsupervised electronic medical record-based medical behavior compliance assessment method is characterized by comprising the following steps of:
s1, case data are collected, cleaned and preprocessed, the case data comprise personal information, admission data, medical history data, examination data, diagnosis and treatment results, medical operation data and hospitalization data of a patient, the medical operation data comprise medical order data and operation data, and the medical order data and the operation data are in a time series form;
s2, classifying the case data of the patient according to the personal information, the medical history data, the examination data, the diagnosis data and the diagnosis and treatment results of the patient, and constructing a fuzzy concept with similar index values;
s3, clustering the order data of the patients;
s4, integrating the medical advice data and the operation data after patient clustering, and mining a diagnosis and treatment process model according to the category of the fuzzy concept to which the patient belongs and the effect of the patient after diagnosis and treatment;
s5, customizing a cost function of the diagnosis and treatment process model, aligning the actual diagnosis and treatment sequence with the excavated diagnosis and treatment process model based on the cost function, positioning the position of the abnormality and calculating the deviation degree of the abnormality.
2. The unsupervised electronic medical record-based medical behavior compliance assessment method according to claim 1, wherein: in step S2, using the fuzzy form concept analysis theory, each patient with a history of complete clinical paths is regarded as an object of the fuzzy form background, each type of index is regarded as an attribute of the fuzzy form background, the value of the form background is normalized, a threshold is set for each attribute and similar disease patients are merged, so as to simplify the fuzzy form background and construct fuzzy concepts, each fuzzy concept represents a specific patient group with similar index values.
3. The unsupervised electronic medical record-based medical behavior compliance assessment method according to claim 1, wherein: in step S3, firstly, the medical advice data is clustered by adopting a multi-granularity topic model, then the medical advice data after topic clustering is clustered by adopting a K-means + + algorithm according to the day to reduce the difficulty of the medical behavior compliance evaluation,
if the number of subjects in the order data is t, the similarity between the patient i and the patient j on the m-th day and the n-th day is described as follows:
Disi,m=(pim1k1,pim2k2,…,pimtkt) (2)
wherein D represents the order data, Disi,mTopic probability distribution representing a topic vector for patient i over a total of t dimensions on day m, p representing topic probability, k representing weight of the corresponding topic, S (D)i,m,Dj,n) Represents the similarity between patient i and patient j on day m and day n, pimtktA topic probability distribution representing the patient i day m dimension t topic vector.
4. The unsupervised electronic medical record-based medical behavior compliance assessment method according to claim 1, wherein: in step S4, a Imf process discovery algorithm in the ProM process mining software is used to mine the diagnosis and treatment process model for the subdivided "cured" or "improved" patient data.
5. The unsupervised electronic medical record-based medical behavior compliance assessment method according to claim 1, wherein: in step S5, on the premise of the frequency of the specific medical action, the cost function of the diagnosis and treatment process model based on the TF-IDF weighting technology is used to quantify the cost of the inserted or skipped medical action, and the alignment between the actual diagnosis and treatment sequence and the standard diagnosis and treatment process model is realized by the pnetpply plug-in the ProM process mining software, so as to determine the position and deviation degree of the abnormality.
6. The unsupervised electronic medical record-based medical behavior compliance assessment method according to claim 5, wherein: in step S5, when inserting event x of medical sequence Seq, the insertion cost Cos t (x) is specifically described as follows:
wherein N (Seq) is the total number of occurrences of the treatment sequence Seq, N (Seq)xThe number of occurrences of the medical sequence Seq including the insertion event x, N is the number of samples of all medical sequences, and N (x) is the number of medical sequences including the insertion event x.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110489454.XA CN112992370B (en) | 2021-05-06 | 2021-05-06 | Unsupervised electronic medical record-based medical behavior compliance assessment method |
PCT/CN2021/132173 WO2022233121A1 (en) | 2021-05-06 | 2021-11-22 | Unsupervised medical behavior compliance assessment method based on electronic medical record |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110489454.XA CN112992370B (en) | 2021-05-06 | 2021-05-06 | Unsupervised electronic medical record-based medical behavior compliance assessment method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112992370A CN112992370A (en) | 2021-06-18 |
CN112992370B true CN112992370B (en) | 2021-07-30 |
Family
ID=76336976
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110489454.XA Active CN112992370B (en) | 2021-05-06 | 2021-05-06 | Unsupervised electronic medical record-based medical behavior compliance assessment method |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112992370B (en) |
WO (1) | WO2022233121A1 (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112992370B (en) * | 2021-05-06 | 2021-07-30 | 四川大学华西医院 | Unsupervised electronic medical record-based medical behavior compliance assessment method |
CN113553399B (en) * | 2021-07-16 | 2022-05-27 | 山东建筑大学 | Text search method and system based on fuzzy language approximate concept lattice |
CN115083616B (en) * | 2022-08-16 | 2022-11-08 | 之江实验室 | Chronic nephropathy subtype mining system based on self-supervision graph clustering |
CN115910387A (en) * | 2022-11-08 | 2023-04-04 | 北京健康在线技术开发有限公司 | Data processing method, device and equipment based on time sequence and storage medium |
CN116453637B (en) * | 2023-03-20 | 2023-11-07 | 杭州市卫生健康事业发展中心 | Health data management method and system based on regional big data |
CN118522396B (en) * | 2024-07-23 | 2024-10-01 | 济南科汛智能科技有限公司 | Clinical diagnosis and treatment data input method and system |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106202477A (en) * | 2016-07-18 | 2016-12-07 | 北京千安哲信息技术有限公司 | Medical expense method for digging and device |
CN106951719A (en) * | 2017-04-10 | 2017-07-14 | 荣科科技股份有限公司 | The construction method and constructing system of clinical diagnosis model, clinical diagnosing system |
CN110335684A (en) * | 2019-06-14 | 2019-10-15 | 电子科技大学 | The intelligent dialectical aid decision-making method of Chinese medicine based on topic model technology |
CN111696640A (en) * | 2020-06-12 | 2020-09-22 | 上海联影医疗科技有限公司 | Method, device and storage medium for automatically acquiring medical record template |
CN111832298A (en) * | 2020-07-14 | 2020-10-27 | 北京百度网讯科技有限公司 | Quality inspection method, device and equipment for medical records and storage medium |
CN112735597A (en) * | 2020-12-31 | 2021-04-30 | 荆门汇易佳信息科技有限公司 | Medical text disorder identification method driven by semi-supervised self-learning |
Family Cites Families (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150046181A1 (en) * | 2014-02-14 | 2015-02-12 | Brighterion, Inc. | Healthcare fraud protection and management |
US9779407B2 (en) * | 2014-08-08 | 2017-10-03 | Brighterion, Inc. | Healthcare fraud preemption |
US11521717B2 (en) * | 2014-02-21 | 2022-12-06 | Intelligent Medical Objects, Inc. | System and method for generating and updating a user interface to evaluate an electronic medical record |
CN104915560A (en) * | 2015-06-11 | 2015-09-16 | 万达信息股份有限公司 | Method for disease diagnosis and treatment scheme based on generalized neural network clustering |
US10659225B2 (en) * | 2017-06-30 | 2020-05-19 | Microsoft Technology Licensing, Llc | Encrypting existing live unencrypted data using age-based garbage collection |
US10886013B1 (en) * | 2017-11-15 | 2021-01-05 | Iodine Software, LLC | Systems and methods for detecting documentation drop-offs in clinical documentation |
CN109243567B (en) * | 2018-08-14 | 2021-11-02 | 山东科技大学 | Medicine recommendation method based on prescription data mining |
CN111667927A (en) * | 2020-06-05 | 2020-09-15 | 山东凯鑫宏业生物科技有限公司 | ZigBee network intelligent medical system and acquisition node networking method thereof |
CN111916191B (en) * | 2020-07-22 | 2024-07-02 | 复旦大学 | Medical behavior operation compliance assessment system based on medical behavior data |
CN112992370B (en) * | 2021-05-06 | 2021-07-30 | 四川大学华西医院 | Unsupervised electronic medical record-based medical behavior compliance assessment method |
-
2021
- 2021-05-06 CN CN202110489454.XA patent/CN112992370B/en active Active
- 2021-11-22 WO PCT/CN2021/132173 patent/WO2022233121A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106202477A (en) * | 2016-07-18 | 2016-12-07 | 北京千安哲信息技术有限公司 | Medical expense method for digging and device |
CN106951719A (en) * | 2017-04-10 | 2017-07-14 | 荣科科技股份有限公司 | The construction method and constructing system of clinical diagnosis model, clinical diagnosing system |
CN110335684A (en) * | 2019-06-14 | 2019-10-15 | 电子科技大学 | The intelligent dialectical aid decision-making method of Chinese medicine based on topic model technology |
CN111696640A (en) * | 2020-06-12 | 2020-09-22 | 上海联影医疗科技有限公司 | Method, device and storage medium for automatically acquiring medical record template |
CN111832298A (en) * | 2020-07-14 | 2020-10-27 | 北京百度网讯科技有限公司 | Quality inspection method, device and equipment for medical records and storage medium |
CN112735597A (en) * | 2020-12-31 | 2021-04-30 | 荆门汇易佳信息科技有限公司 | Medical text disorder identification method driven by semi-supervised self-learning |
Non-Patent Citations (2)
Title |
---|
Semi-Supervised Patient Similarity Clustering Algorithm Based on Electronic Medical Records;Jiao Zhang等;《IEEE Access》;20190724;第7卷;第90705-90714页 * |
电子病历命名实体识别和实体关系抽取研究综述;杨锦锋 等;《自动化学报》;20140831;第40卷(第8期);第1537-1562页 * |
Also Published As
Publication number | Publication date |
---|---|
CN112992370A (en) | 2021-06-18 |
WO2022233121A1 (en) | 2022-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN112992370B (en) | Unsupervised electronic medical record-based medical behavior compliance assessment method | |
Kushwaha et al. | Machine learning algorithm in healthcare system: A Review | |
Shailaja et al. | Machine learning in healthcare: A review | |
Chen et al. | Semi-supervised learning via regularized boosting working on multiple semi-supervised assumptions | |
Meyfroidt et al. | Machine learning techniques to examine large patient databases | |
CN113688248A (en) | Medical event identification method and system under condition of small sample weak labeling | |
Jader et al. | Fast and Accurate Artificial Neural Network Model for Diabetes Recogni-tion | |
CN110890146A (en) | Bedside intelligent interaction system for intelligent ward | |
Peng et al. | BiteNet: bidirectional temporal encoder network to predict medical outcomes | |
Abdul-Jabbar | Data analytics and techniques | |
Qian et al. | Incomplete label distribution feature selection based on neighborhood-tolerance discrimination index | |
Karmani et al. | A review of machine learning for healthcare informatics specifically tuberculosis disease diagnostics | |
Alves et al. | Variational autoencoders for medical image retrieval | |
Kumar et al. | Disease prediction using machine learning algorithms KNN and CNN | |
Chakraborty et al. | Covid-19 and diabetes risk prediction for diabetic patient using advance machine learning techniques and fuzzy inference system | |
Kumar et al. | Special disease prediction system using machine learning | |
Cheng et al. | Combining knowledge extension with convolution neural network for diabetes prediction | |
Hughes et al. | Prediction-constrained topic models for antidepressant recommendation | |
Batal et al. | Temporal Data Mining for Healthcare Data. | |
Jiyun et al. | Patient similarity measuring with graph embedded learning and triplet network | |
Karmani et al. | Taxonomy on Healthcare System Based on Machine Learning Approaches: Tuberculosis Disease Diagnosis. | |
Awari | Diseases prediction model using machine learning technique | |
CN118538399B (en) | Intelligent pediatric disease diagnosis auxiliary system | |
Sapna et al. | Integration of Fuzzy Clustering Technique with Big Data for Disease Diagnosis | |
Patidar et al. | An efficient SVM and ACO-RF method for the cluster-based feature selection and classification |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |