[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN107705839A - Disease automatic coding and system - Google Patents

Disease automatic coding and system Download PDF

Info

Publication number
CN107705839A
CN107705839A CN201711013515.5A CN201711013515A CN107705839A CN 107705839 A CN107705839 A CN 107705839A CN 201711013515 A CN201711013515 A CN 201711013515A CN 107705839 A CN107705839 A CN 107705839A
Authority
CN
China
Prior art keywords
result
disease
matching
keyword
concept
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201711013515.5A
Other languages
Chinese (zh)
Other versions
CN107705839B (en
Inventor
吴军
宋伟
高希余
桑波
巩玉强
樊昭磊
张琪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongyang Health Technology Group Co ltd
Original Assignee
Shandong Yang Yang Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Yang Yang Software Co Ltd filed Critical Shandong Yang Yang Software Co Ltd
Priority to CN201711013515.5A priority Critical patent/CN107705839B/en
Publication of CN107705839A publication Critical patent/CN107705839A/en
Application granted granted Critical
Publication of CN107705839B publication Critical patent/CN107705839B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Medical Treatment And Welfare Office Work (AREA)

Abstract

The invention discloses disease automatic coding and system, comprise the following steps:Receive input data:Input data includes:Raw diagnostic data and patient file data;Raw diagnostic data and patient file data to input pre-process;Retrieved in criteria classification coding ICD 10, judge whether to obtain result, if it is not, then carrying out word segmentation processing, association's conversion processing and search matching tree processing to raw diagnostic data, then filter out optimal result from the result of matching tree;If result be not it is optimal if result split, word segmentation processing, association's conversion processing and search matching tree processing are carried out to each individually disease name, then optimal result is filtered out from the result of matching tree, or go out optimal result from patient file data screening, the coding accuracy of assessment result, exports coding result and accuracy evaluation result.Efficiently solve the standard diagnostics that each medical institutions use and encode skimble-scamble problem.

Description

Disease automatic coding and system
Technical field
The present invention relates to medical information technical field and field of artificial intelligence, more particularly to a kind of disease to compile automatically Code method and system.
Background technology
With country's deepening continuously and deepen to the layout of public medical and medical reform, medical services are comprehensive to be carried Rise and urgent requirement is proposed to medical information.In medical insurance, public health service, hospital informatioization management etc., medical treatment Informationization is from the process automation management development of completion overall process record to internet+health medical treatment;In present big data and people Under work intelligence background, medical information progressively touches the core business of medical treatment, provides diagnosis and treatment process aid decision and instructs to join Examine.In medical profession and medical teaching, academic exchange, data analysis or even medical artificial intelligence's aid decision, standardization Information (diagnosis, case history, check, medicine etc.) record and exchange to be the essential even most important thing, wherein it is primary just It is the standardization of diagnosis:Standard diagnostics encode (ICD).
(1) the non-constant disunity of understanding and grasping of the coder for standard code, causes coding result to have differences.
Diagnosis in first page of illness case reports, before medical insurance reimbursement in case history filing, medical record, by case of hospital room coder according to According to《The international statistical classification of diseases and related health problems》(being commonly called as handbook), and defend the standard diagnostics coded word of planning commission's issue Allusion quotation, the diagnosis on first page of illness case are encoded.What this work was manually performed, therefore manpower is limited by, encoding human Member's quality, the deciphering to standard and the assurance degree to the huge content of encoder dictionary, and length of service and experience, so as to daily Manually to the limited amount of code, and erroneous judgement be present, and encode fineness and criterion also varies with each individual, not With hospital data contrast with exchange, these problems will be highlighted and are exaggerated, in addition between hospital of same institute different coding person, Same coder's different times, encode that there is also difference.
(2) the actual diagnosis of personalized, the self-defined input of doctor is not exclusively compatible with standard diagnostics code database.
Because the diagnosis actually used in doctor's routine work can not be completely covered in the diagnosis included in diagnosis coding storehouse, and And for same diagnosis, due to the particularity of Chinese, in the extreme diversity and industry of same the concept of diseases form of presentation simultaneously In the absence of unified standard this specification of medical terms, the title literary style and structure of the diagnosis that different doctors uses also phase not to the utmost Together, multifarious, the alias used has very big difference;Renewal plus standard diagnostics dictionary takes long enough (national standard ICD10, last time update 2009 apart from this renewal just 8 years as long as in the past 2017), it can not cover in time most emerging Disease and its title;And the clinical practice requirement of doctor is not reached for the parting of particular diagnosis, the level of detail of standard code; Or doctor write diagnose when, can add some additional detailed informations when writing and diagnosing, and standard diagnostics code database The difference classification brought for extraneous information can not embody;Furthermore doctor also can use some only when writing diagnosis Have it is known in the industry write a Chinese character in simplified form referred to as or abbreviation, and these are nor standard diagnostics coding place is included.Essentially, Doctor is the detailed record disease information from diagnosis and treatment operational angle, rather than for sorting code number, so incompatible feelings Condition is inveteracy inevitable, and above-mentioned a variety of causes result in most doctors in hospital's practical business and fill in diagnosis When using procedure selection or hand-written coding many mistakes all be present.
(3) standard diagnostics code database version disunity.
Mainly in national standard diagnosis coding storehouse, 09 year version combines respectively the code database that current hospital each on the market uses The demand modifications and extensions of individual hospital oneself, which are spread out, to be stretched, and the part of modifications and extensions does not have special organisations and institutions' unification Management and distribution, the version between hospital and hospital can not be completely compatible, in addition have hospital use earlier country mark Quasi- version change spreads out what is stretched, even if so causing same disease, the coding used in Different hospital cannot guarantee that Complete unity, and each hospital part stretched of being spread out by national standard version just less can guarantee that unanimously.
(4) in the accuracy of Medical Record Statistics data, medical treatment, teaching, the accuracy of scientific research data retrieval, disease packet DRGS Medical insurance pre-payment (DRGS-PPS) sound development etc., diagnosis coding correct and that standard is unified is the base of everything Plinth.
(5) it is that coding is not accurate enough the defects of prior art, for the middle disease for carrying conjunction, prior art does not have Process of refinement, this problem is not often considered, cause disease code result fineness inadequate.
Whether it is still basic herein in population health information platform, the health medical treatment data standard of the unified authority of structure On improve all kinds of basic businesses of population health information using upper, and after being built in medical information platform, medical information warehouse High-level application, including but not limited to medical dynamic monitoring, disease control, medical research, medical artificial intelligence, aid decision etc., Diagnostic message cross-platform exchange and is required to keep stable and consistent after collecting across mechanism, and is based on noted earlier four Individual reason, diagnostic message are needed during collecting, interact, collect, analyze by substantial amounts of code standardized work.
The content of the invention
The purpose of the present invention is exactly to solve the above problems, there is provided a kind of disease automatic coding and system, according to Doctor inputs diagnosis, and with reference to the parsing participle and semantic understanding to case history, automatic reference standard diagnostics library ICD-10 is encoded; Its correct possibility can be assessed coding result.
To achieve these goals, the present invention adopts the following technical scheme that:
Disease automatic coding, comprises the following steps:
Step (1):Receive input data:The input data includes:Raw diagnostic data and patient file data;
Step (2):Raw diagnostic data and patient file data to input pre-process;
Step (3):The pre-processed results obtained with step (2) are in GB/T 14396-2016《Classification of diseases and code》And Retrieved in international disease criterion sorting code number ICD-10, judge whether to obtain result, if obtaining result, direct exports coding As a result;If it is not, into step (4);
Step (4):Word segmentation processing, association's conversion processing and search matching tree are carried out to pretreated raw diagnostic data Processing, optimal result then is filtered out from the result of matching tree, judges whether it is optimal result, if so, then entering step Suddenly (7);If not optimal result, is judged as whether current word segmentation processing result contains two and its above disease name, if Then enter step (5), otherwise into step (6);
Step (5):The result of step (4) is split as several single disease names, for each single Disease name carries out word segmentation processing, association's conversion processing and search matching tree processing, is then sieved from the result of matching tree Optimal result is selected, into step (7);
Step (6):Patient file data progress word segmentation processing, unstructured data to step (1) input are converted to knot Structure data, association's conversion processing and search matching tree processing, then filter out optimal result from the result of matching tree, Into step (7);
Step (7):The coding accuracy of assessment result, exports coding result and accuracy evaluation result.
The pretreatment of the step (2) includes:Punctuation mark is removed, variant Chinese character is converted into upright letters, by double byte character Be converted to half-angle character.
The word segmentation processing, refers to:Will sentence segmentation be segmented for several diagnosis keywords, the diagnosis keyword, Including representing the qualifier for limiting modification and the primary keyword for representing disease;The qualifier refers to descriptive nature, position or journey Spend the word of type;The primary keyword refers to the word for describing disease, abnormal structure, abnormal body or abnormal symptom;
Association's conversion processing, refers to:Obtained qualifier will be segmented and primary keyword is marked in medical semantic network On, association's conversion is carried out respectively to qualifier and primary keyword using medical semantic network, by former primary keyword and former main key The new primary keyword obtained after word association conversion is carried out with the new qualifier obtained after former qualifier and former qualifier association conversion Permutation and combination, finally give all combinations between primary keyword and qualifier in raw diagnostic data;
Such as:Basal ganglia infarction, it is Basal ganglia and infarct after word segmentation processing, is converted by semantic network, Basal ganglia association Basal ganglia, brain stem and brain are converted into, infarct association is converted into infarct and infraction, then the knot after this two keywords and conversion The result of fruit combination includes:Combination 1:Basal ganglia _ infraction, combination 2:Basal ganglia _ infarct, combination 3:Brain stem _ infraction, combination 4:Brain Dry _ infarct, combination 5:Brain _ infraction, combination 6:Brain _ infarct.Combination 1 to 6 is exactly all combinations.
The search matching tree processing, refers to:Institute between the primary keyword and qualifier that are obtained according to association's conversion processing Have combination, searched out from matching forest and the matching tree of leaf is completely covered corresponding to each combination, result be a matching set, Several matchings are set or without result;
It is described to filter out optimal result from the result of matching tree, refer to:
Step (a1):The keyword quantity on matching tree that will match to carries out descending arrangement and compared respectively, if ranking First identical with second or have with the identical to rank the first it is multiple, then by the keyword quantity split with matching tree Keyword ratio of number carry out ascending order arrangement compare;If ranking the first of obtaining identical with second or with ranking the first Identical have it is multiple, then into step (a2);
Step (a2):Qualifier and primary keyword are overlapped in the conversion distance of medical semantic network, superposition is tied Fruit carries out descending arrangement and compared, if rank the first identical with second or having with the identical to rank the first multiple, enters Enter step (a3);
Step (a3):The matching degree of matching tree is calculated, the matching degree of the matching tree is equal to the main pass that participle obtains The quantity of keyword and the primary keyword ratio of number for matching tree, descending arrangement is carried out to ratio of number and is compared;If rank the first It is identical with second or have with the identical to rank the first multiple, then terminate;
In step (a1)-step (a3), if optimal result only has one, that is, the result to rank the first only has one (second place and follow-up all different from first place), then it represents that current matching tree is Optimum Matching tree.
It is described to judge whether current word segmentation processing result contains two and its above disease name, basis for estimation be word it Between conjunction whether there is, if conjunction be present, then it represents that current word segmentation processing result contains two and its above disease name, if In the absence of conjunction, then it represents that current word segmentation processing result does not contain two and its above disease name.
In the step (6), case history text is non-structured text, after carrying out word segmentation processing to patient file data, Unstructured data after word segmentation processing is converted into structural data, structural data is stored classifiedly according to generic, Generic includes:Personnel, organ, time, place, frequency, symptom, operation, medicine, medical history, divide from structural data is corresponding The information related to diagnosis is extracted in class as supplement keyword;Association's conversion processing and search are carried out to supplement keyword again Handled with tree, then optimal result is filtered out from the result of matching tree, into step (7);It is described related to diagnosis Information includes:Family history, Genetic history, disease property and time in pregnancy period;
The coding accuracy of the assessment result, exist from the matching degree of result and raw diagnostic data, diagnosis keyword Converting for medical semantic network diagnoses order of the keyword with diagnosing keyword in standard diagnostics in distance, raw diagnostic data Three angles of otherness are assessed;
The matching degree of the result and raw diagnostic data, it is:In all matching tree results matched, count first Calculate every group of qualifier being syncopated as and primary keyword total quantity and set the ratio between the qualifier included and primary keyword total quantity with matching Value, as the first ratio;Secondly the ratio of number of primary keyword of the primary keyword quantity being syncopated as with matching tree is calculated, is Second ratio;The matching degree of second ratio and the first ratio, as result and raw diagnostic data.
The diagnosis keyword is in the conversion distance of medical semantic network:Keyword will each be diagnosed medical semantic Network is transformed into the path length that diagnosis keyword corresponding to matching tree passes through and is denoted as a transformation ratio, calculates all match Diagnosis keyword transformation ratio natural logrithm sum, as diagnose keyword medical semantic network conversion distance.
By taking basal ganglia infarction as an example, correspond in cerebral infarction, Basal ganglia is transformed into brain, and conversion weight is 0.3, infarct conversion To infarct, conversion weight is 1, then using the form calculus weighted superposition result of natural logrithm sum as ln (0.3)+ln (1)=- 1.204。
The otherness of order of the keyword with diagnosing keyword in ICD-10 is diagnosed in the raw diagnostic data, is:It is first First calculate same position of the diagnosis keyword in raw diagnostic data and in ICD-10 position sequence difference, then calculate The absolute value sum of the sequence difference of all diagnosis keywords.
As a result coding accuracy evaluation formula:
Y=wTX+b;
Wherein, y is estimation accuracy, and X is vector (x1,x2,x3), wherein x1Represent of result and raw diagnostic data With degree, x2Represent diagnosis keyword in the conversion distance of medical semantic network, x3Represent to diagnose keyword in raw diagnostic data With the otherness for the order that keyword is diagnosed in ICD-10.
Represent three process datas, w in matching processTFor vectorial w transposition, w vectors are (w1,w2,w3), wherein, w1, w2,w3, b is constant.
In the step (1),
Raw diagnostic data, scope include:The discharge diagnosis of diagnosis, first page of illness case in patient file, pathological diagnosis with And the external cause diagnosis of Injuries and poisoning;
Patient file data, including:From first page of illness case, enter discharge record, progress note, operation record, pathological replacement or Examine audit report, side information.
The side information includes:Age, sex, site of pathological change, disease property, peri-operation period, hospital infection disease, sheet It is secondary diagnosis and treatment purpose, main diagnostic message, inspection, pathology, imaging information, familial, heredity, old, sequelae, congenital Property disease, operation or the mode of production.
Qualifier, including:Position, disease property, orientation, disease parting, degree etc., for example, it is left side, right side, acute, first Nature, icteric, leaf etc. on lung.
Keyword is diagnosed, including:Disease, abnormal structure's composition etc., such as pneumonia, deformity, wandering kidney etc..
The cutting structure formed between disease and disease, such as A diseases cause B diseases with B diseases (parallel construction), A diseases Sick (modification limiting structure), A diseases (B diseases), (progressive structure) etc..
The word segmentation processing, refer to according to GB/T 14396-2016《Classification of diseases and code》And international disease criterion point Class encodes ICD-10 and full cutting is carried out to the raw diagnostic data after data cleansing, and each word is as diagnosis in cutting result Keyword;Keyword is diagnosed, including:Represent the qualifier for limiting modification and the primary keyword for representing disease;
Semantic network is a kind of structured way that knowledge is represented with figure;In a semantic network, information is expressed For one group of node, node is connected with each other by the directed line of one group of tape label, for representing the relation between node.
The medical semantic network is the semantic network of medical field, and the node body of medical semantic network is medical domain Concept, the medical domain concept node are connected with other medical domain concept nodes, and each medical domain concept node is again It is connected with the disease concept form of expression node of itself;Each medical domain concept node is also general with property concept node, degree Read node, position concept node or the connection of body concept node;Relation between the node of the medical semantic network is medical treatment Relation between field concept.
The medical field concept, including:The physiology region of anatomy, body tissue, composition, disease exception, bacterial virus, disease Reason, disease property;
Relation between medical domain concept, including:Correlation, transforming relationship, correlation weight, transforming relationship power The relation of weight and concept to specific manifestation.
Correlation between medical domain concept and concept, including:The including, be affiliated of concept, abstract or specific manifestation;
Transforming relationship between medical domain concept and concept, including:Concept it is close or identical;
By medical semantic network, the association and conversion of real concept, so as to expand the hunting zone of concept and association's model Enclose;And the corresponding specific manifestation of each concept is various informative, and not only include the title of written specification, also cover reality The colloquial title used, thus specification term and the incompatible of practical application of official standard diagnosis of having prevented and conflicting is asked Topic.
More than one for disease quantity in raw diagnostic data, diagnosis cutting result can be divided into two or more portions Point, each part includes a primary keyword and corresponding qualifier.During matching, various pieces are searched as a group input Rope matching tree.Some standard diagnostics include multiple diseases, are made up of so it matches leaf some, and each part has Qualifier and keyword, and include the relation between each several part.Relation includes, with concurrently, leading between described each several part Cause, not with exclusion etc..
Matching forest includes several matching trees, each matching tree, including:Tree root, trunk, branch and leaf;Described Tree root with tree represents diagnosis concept, shows as ICD codings;The trunk of the matching tree represents the performance diagnosis name of diagnosis concept Claim;(usual standard diagnostics only have one for the concrete composition part of the branch expression diagnosis concept performance diagnosis name of the matching tree Individual disease, trunk is one;When shelves standard diagnostics include multiple diseases, trunk is respective amount);The tree of the matching tree Leaf represents the qualifier and primary keyword of the concrete composition part of diagnosis concept performance diagnosis name.
Match the forming process of forest:Single standard diagnostics are a concepts, and concept includes several forms of expression;Often Kind of the form of expression have the structure of oneself, the conceptual entity included and comprising each conceptual entity between correlation;Each Concept that standard diagnostics represent, structure, the conceptual entity included and comprising each conceptual entity between correlation tree Structure represents, is defined as matching tree, and the matching tree of all standard diagnostics forms matching forest, also, is referred to according to icd standard South, in forest is matched, priority and inclusion relation be present between matching tree.
The form of expression, such as:Title.
Every kind of form of expression has the structure of oneself:Side by side, progressive explanation, cause and effect etc.;
Every kind of form of expression has the conceptual entity included of oneself:Symptom, disease, operation etc.;
Every kind of form of expression has the correlation between each conceptual entity included of oneself:Keyword and qualifier, limit Periodical repair decorations etc.;
The root of the matching tree represents diagnosis concept, and its concept shows as ICD codings;Such as:What Meniere disease was stated Concept is that a kind of pathological change is labyrintine hydrops, clinical manifestation be the rotatory vertigo of recurrent exerbation, fluctuation Hearing, Tinnitus and the idiopathic disease of inner ear of the vexed swollen sense of ear.
Due to concept be abstract things, it is necessary to which a unique mark identifies to correspond to, title is one kind of concept Performance, and exactly the corresponding of each disease identifies standard diagnostics coding ICD, also just turns into the unique of each concept naturally Mark, that is, the performance of the concept of diseases.
Such as the concept of Meniere disease expressed above, in standard diagnostics, the ICD of Meniere disease is encoded to H81.000, So this ICD codings H81.000 is exactly the performance of the concept of Meniere disease, meanwhile, the root as the matching tree of Meniere disease.
The trunk of the matching tree represents diagnosis concept performance diagnosis name;Such as:The performance title of Meniere disease concept There are Meniere disease, auditory vertigo and labyrintine hydrops.
The branch of the matching tree represents the concrete composition part of diagnosis concept performance diagnosis name, such as mitral stenosis With tricuspid insufficiency, there are two branches, respectively mitral stenosis and tricuspid insufficiency;
The leaf of the matching tree represents the qualifier of the concrete composition part of diagnosis concept performance diagnosis name and main pass Keyword;Such as:Auditory vertigo, primary keyword are dizziness, and qualifier is auditory.
If conjunction be present, when the result of step (4) is split as into several single disease names, at conjunction pair Word carries out cutting:
For example the structure containing multiple diseases is:
A) qualifier (0 is either multiple) _ primary keyword _ conjunction _ qualifier (0 or multiple) _ primary keyword
B) qualifier (0 is either multiple) _ primary keyword _ qualifier (0 or multiple) _ primary keyword
C) qualifier (0 is either multiple) _ primary keyword _ qualifier (0 or multiple) _ primary keyword
Continue to add multiple qualifiers (0 or multiple) structure as _ primary keyword below.
Word segmentation processing is carried out to medical record data file using natural language processing program ansj_seg, the result after processing is The data of structuring;The word separated is stored classifiedly according to generic, such as is categorized as personnel, organ, the time, place, Frequency, symptom, operation, medicine, medical history etc. store classifiedly the data as structuring;From the data of structuring, extract and examine The information for correlation of breaking, such as:The cause of disease such as perinatal period, bacterium, fungi such as gestation, childbirth, puerperium, family's disease, genetic disease Or the disease property such as congenital, posteriority, damage, the external cause of poisoning, the cytomorphology classification of cancer.
The search matching tree processing, there is three kinds of situations,
The first is no result, and the result of output is sky, then the reason for prompting is without matching result is original diagnostic information Deficiency.
Second is to have result, and the optimal result selected that finally sorts only has one, then using this optimal result as Final matching results export.
The third is that have result, and the optimal result selected has multiple, then output result is sky, and is prompted without matching result The reason for be to have multiple matching degree identical results, and using multiple optimal results as prompt message a part export;Need Re-enter diagnosis and more detailed information is provided in original basis.
Disease automatic coding system, including:Memory, processor and storage are run on a memory and on a processor Computer instruction, when the computer instruction is executed by processor, complete following steps:
Step (1):Receive input data:The input data includes:Raw diagnostic data and patient file data;
Step (2):Raw diagnostic data and patient file data to input pre-process;
Step (3):The pre-processed results obtained with step (2) are in GB/T 14396-2016《Classification of diseases and code》And Retrieved in international disease criterion sorting code number ICD-10, judge whether to obtain result, if obtaining result, direct exports coding As a result;If it is not, into step (4);
Step (4):Word segmentation processing, association's conversion processing and search matching tree are carried out to pretreated raw diagnostic data Processing, optimal result then is filtered out from the result of matching tree, judges whether it is optimal result, if so, then entering step Suddenly (7);If not optimal result, is judged as whether current word segmentation processing result contains two and its above disease name, if Then enter step (5), otherwise into step (6);
Step (5):The result of step (4) is split as several single disease names, for each single Disease name carries out word segmentation processing, association's conversion processing and search matching tree processing, is then sieved from the result of matching tree Optimal result is selected, into step (7);
Step (6):Patient file data progress word segmentation processing, unstructured data to step (1) input are converted to knot Structure data, association's conversion processing and search matching tree processing, then filter out optimal result from the result of matching tree, Into step (7);
Step (7):The coding accuracy of assessment result, exports coding result and accuracy evaluation result.
A kind of computer-readable recording medium, is stored thereon with computer instruction, and the computer instruction is held by processor During row, following steps are completed:
Step (1):Receive input data:The input data includes:Raw diagnostic data and patient file data;
Step (2):Raw diagnostic data and patient file data to input pre-process;
Step (3):The pre-processed results obtained with step (2) are in GB/T 14396-2016《Classification of diseases and code》And Retrieved in international disease criterion sorting code number ICD-10, judge whether to obtain result, if obtaining result, direct exports coding As a result;If it is not, into step (4);
Step (4):Word segmentation processing, association's conversion processing and search matching tree are carried out to pretreated raw diagnostic data Processing, optimal result then is filtered out from the result of matching tree, judges whether it is optimal result, if so, then entering step Suddenly (7);If not optimal result, is judged as whether current word segmentation processing result contains two and its above disease name, if Then enter step (5), otherwise into step (6);
Step (5):The result of step (4) is split as several single disease names, for each single Disease name carries out word segmentation processing, association's conversion processing and search matching tree processing, is then sieved from the result of matching tree Optimal result is selected, into step (7);
Step (6):Patient file data progress word segmentation processing, unstructured data to step (1) input are converted to knot Structure data, association's conversion processing and search matching tree processing, then filter out optimal result from the result of matching tree, Into step (7);
Step (7):The coding accuracy of assessment result, exports coding result and accuracy evaluation result.
Beneficial effects of the present invention:
1. solve doctor's raw diagnostic correspond to standard diagnostics can only be by being accomplished manually, mainly by coder The medical knowledge and coding specification knowledge understood by itself, could complete the problem of this works.It is partial to breach needs By language understanding, the difficulty thought deeply by medical knowledge.Solve the unfettered limitation of term that doctor inputs diagnosis, no doctor Vocabulary standard, which can refer to caused same diagnosis concept, but several diagnosis names and substantial amounts of different specific literary styles, so as to The problem of difficult is compareed with standard diagnostics.
2. efficiently solving the standard diagnostics that each medical institutions use encodes skimble-scamble problem.Autocoding is used Afterwards, raw diagnostic is corresponded on same set of standard diagnostics coding, and criteria for classification is unified, is protected during medical institutions' data exchange Card standard is unified.
3. criteria for classification is stable, it is unstable to solve coder's criteria for classification caused by the description word by raw diagnostic It is fixed, the problem of coding is inconsistent is corresponded to before and after same diagnosis several times.
4. using computer program autocoding, the human resources of flood tide are not only saved, and the very big amplitude of efficiency carries Height, accuracy compares h coding's raising and criteria for classification is unified.Diagnosis number caused by a province (such as Shandong Province) in theory Amount can be completed to encode within several hours.
5. automatic diagnosis coding advantageously ensures that medical treatment, teaching, the data-searching accuracy of scientific research, and disease packet DRGS development.Population health information platform, the health medical treatment data standard of the unified authority of structure, autocoding help performance Power acts on.
6. because autocoding is quick and classification is stable, high-volume in a short time can be achieved original case history is encoded Classification, can be that the big data application of medical field and artificial intelligence quickly prepare and arranges data, based on the field Function plays the role of irreplaceable.
7. coding becomes more meticulous, in cataloged procedure, consider the middle disease for carrying conjunction, the disease with conjunction is torn open Point, multiple single diseases are split into, the fineness of disease code result is so further ensured that, is provided for the use of scientific data Sound assurance.
Brief description of the drawings
Fig. 1 is standard diagnostics Auto-matching flow chart;
Fig. 2 is semantic network structural representation;
Fig. 3 is matching tree construction schematic diagram.
Embodiment
The invention will be further described with embodiment below in conjunction with the accompanying drawings.
As shown in figure 1, disease automatic coding, comprises the following steps:
Step (1):Receive input data:The input data includes:Raw diagnostic data and patient file data;
Step (2):Raw diagnostic data and patient file data to input pre-process;
Step (3):The pre-processed results obtained with step (2) are in GB/T 14396-2016《Classification of diseases and code》And Retrieved in international disease criterion sorting code number ICD-10, judge whether to obtain result, if obtaining result, direct exports coding As a result;If it is not, into step (4);
Step (4):Word segmentation processing, association's conversion processing and search matching tree are carried out to pretreated raw diagnostic data Processing, optimal result then is filtered out from the result of matching tree, judges whether it is optimal result, if so, then entering step Suddenly (7);If not optimal result, is judged as whether current word segmentation processing result contains two and its above disease name, if Then enter step (5), otherwise into step (6);
Step (5):The result of step (4) is split as several single disease names, for each single Disease name carries out word segmentation processing, association's conversion processing and search matching tree processing, is then sieved from the result of matching tree Optimal result is selected, into step (7);
Step (6):Patient file data progress word segmentation processing, unstructured data to step (1) input are converted to knot Structure data, association's conversion processing and search matching tree processing, then filter out optimal result from the result of matching tree, Into step (7);
Step (7):The coding accuracy of assessment result, exports coding result and accuracy evaluation result.
As shown in Fig. 2 medical semantic network is made up of the oriented relation between node and node, node includes conceptual entity And point of specific manifestation, conceptual entity include the disease, region of anatomy, body tissue, composition, disease in the classification of medical field Matter etc..Oriented relation between node include comprising, belong to, specific manifestation, abstract, nearly justice etc..Such as in figure:Uveitis is One conceptual entity, is inclusion relation with conceptual entities such as posterior uveitis, anterior uveitis;It is scorching with pigmented film eye and tunica vasculose It is nearly justice/synonymy;Uveitis belongs to illness in eye simultaneously.
As shown in figure 3, each standard diagnostics matching tree construction includes:Concept is diagnosed, specific manifestation form is standard diagnostics Coding, can be using pictute as tree root;Diagnosis concept specifically show title, can be one be also likely to be it is multiple, can be with shape As being described as trunk;For each title, multiple diseases or referred to as some, each disease or part may be wherein included Can using pictute as trunk, between disease and disease in other words part part between pass of the relationship description between branch System;And keyword possessed by each disease or part can be described as leaf.When the combination of some keywords can be covered completely The keyword (leaf) of lid disease (branch), then mean that matching has suffered this disease;Match simultaneously between each disease Relation character standardization diagnosis in each disease or partial relation, then represent matching in this disease title, Jin Erbiao What is reached is the content of this concept of diseases statement or the subclass of a subdivision.
System and method is included with lower module and algorithm:
1. the characteristic matching network in standard diagnostics storehouse:
Single standard diagnostics essence is a concept, and concept has many forms, is segmented again under every kind of form of expression Connecting each other between conceptual entity, and the conceptual entity of subdivision and structure, the conceptual entity of subdivision is in medical semantic network In there is same or similar conceptual entity to associate and convert again, so, the concept and its knot that each standard diagnostics represent Structure and comprising concept can be represented with the structure of tree, form a matching tree, and the matching tree of all standard diagnostics is formed Match forest.
Matching forest is combined with medical semantic network, constitutes new characteristic matching network:By semantic network, realize Association and conversion, so as to expand search and association's scope, and can completes the matching of standard diagnostics by characteristic matching.
2. diagnosis is split and conceptual entity identification module:
Natural language processing is carried out to raw diagnostic, after character pre-processing, medical concept entity is identified, gives standard Diagnostic characteristic matching network is used to be marked on matching network.
In identification process, medical concept dictionary refines from actual traffic data, thus in medical field ratio In general dictionary is related to more professional and deep.The structure of diagnosis is parsed in medical concept entity procedure is identified, judges to examine Disconnected is reasonable and normative, for lifting accuracy in the matching process.
3. the matching algorithm of diagnosis:
Keyword and structure after splitting will be diagnosed, will be projected on matching network, by the association and conversion of semantic network, The conceptual entity that will likely be expressed is tagged in matching network on semantic network, and then the conceptual entity of these marks passes through matching Tree finds the standard diagnostics for the condition that meets.Standard diagnostics to meeting condition, establishing criteria are diagnosed to original diagnostic information and knot The degree of structure covering, conversion pathway length, and the priority and belonging relation of standard diagnostics, selection are associated on semantic network Go out most suitable matching diagnosis.
4. case history side information extraction module:
Key message and the side information that diagnosis needs to code are extracted from patient file.For example, age bracket, sex, disease Property, peri-operation period etc., and this diagnosis and treatment purpose, main diagnostic message, inspection, pathology, imaging information etc., also hand The information such as art, the mode of production.These information are further as side information all in the case where raw diagnostic obscures or lacks Clarify a diagnosis use.
5. encode accuracy evaluation module:
During matching and diagnosing, by optimal matching result, by coupling path, and raw diagnostic and the mark that matches The information can be caused to cover degree and similarity degree of quasi- diagnosis are recorded.By above-mentioned each factor with different weights collect calculating one it is credible Value, as the foundation for assessing this time correctness of matching.
6. diagnosis and case history input and result output module:
Input module is diagnosed, from interactive interface or electronic health record medical record or enters directly to obtain original examine in case history of leaving hospital It is disconnected.
Diagnosis side information is obtained, it is necessary to which non-structured patient file is segmented from case history, is converted into structuring Patient file, extract wherein necessary information.
As a result output module, it is output in interactive interface or specified file or in database.
The step of autocoding of the present invention, is as follows:
1a obtains the diagnosis of input from interface.
1b is obtained from database and diagnosed, and corresponding diagnosis and therapy recording, patient file.If there is patient file, then hand over Handled by participle program,
Raw diagnostic by diagnostic analysis and conceptual entity identification module, is carried out natural language processing, with medical science semantic net by 2 Based on network, by all possible cutting and identification conceptual entity mode list, and by cutting and recognition result it is unreasonable or Incomplete result progress beta pruning, each conceptual entity then analyzed to rational cutting and recognition result, and in concept Whether the syntactic structure that diagnosis is judged in the structure of entity composition is rational structure, verifies cutting identification conceptual entity in turn Reasonability.
Standard diagnostics matching module below is transferred to be matched from different matching schemes according to the structure after cutting.
3 standard diagnostics matching modules are by the cutting recognition result and structural information of raw diagnostic, in the matching of standard diagnostics On network, scanned for according to matching algorithm.Each diagnosis concept and modification limit concept in search procedure, successively by tool Body surface now arrives concept, concept to association and approximation, includes concept, concept and conceptual combinations to standard diagnostics specific manifestation, standard Diagnose search and conversion pathway of the specific manifestation to standard diagnostics concept.
Meanwhile the degree that includes that can record conversions concepts is searched in conversion process, searching route length, raw diagnostic is split The concept that the concept that goes out includes with end product meet and level of coverage.
If 4 because matching module caused by necessary information missing in raw diagnostic does not reach result, or draws multiple Level of coverage is identical but conceptually differ more diagnosis if, now just need to extract side information from patient file. Patient file is segmented by participle program and is converted into the document of structuring, therefrom extracts the necessary letter related to diagnosis Breath, it is supplemented in the concept of raw diagnostic fractionation, is scanned for again in matching network.
5 accuracy evaluation modules will be searched in matching process, be associated the path of conversion, be searched for the path of matching, original to examine The disconnected concept matching degree for splitting concept and standard diagnostics, the reasonability of raw diagnostic structure and journey similar to standard diagnostics structure Degree, collects calculating by different weights, and the accuracy to code is assessed according to result of calculation.
Although above-mentioned the embodiment of the present invention is described with reference to accompanying drawing, model not is protected to the present invention The limitation enclosed, one of ordinary skill in the art should be understood that on the basis of technical scheme those skilled in the art are not Need to pay various modifications or deformation that creative work can make still within protection scope of the present invention.

Claims (10)

1. disease automatic coding, it is characterized in that, comprise the following steps:
Step (1):Receive input data:The input data includes:Raw diagnostic data and patient file data;
Step (2):Raw diagnostic data and patient file data to input pre-process;
Step (3):The pre-processed results obtained with step (2) are in GB/T 14396-2016《Classification of diseases and code》It is and international Retrieved in disease criterion sorting code number ICD-10, judge whether to obtain result, if obtaining result, direct exports coding knot Fruit;If it is not, into step (4);
Step (4):Pretreated raw diagnostic data is carried out at word segmentation processing, association's conversion processing and search matching tree Reason, optimal result then is filtered out from the result of matching tree, judges whether it is optimal result, if so, then entering step (7);If not optimal result, is judged as whether current word segmentation processing result contains two and its above disease name, if then Into step (5), otherwise into step (6);
Step (5):The result of step (4) is split as several single disease names, for each single disease Title carries out word segmentation processing, association's conversion processing and search matching tree processing, is then filtered out from the result of matching tree Optimal result, into step (7);
Step (6):Patient file data progress word segmentation processing, unstructured data to step (1) input are converted to structuring Data, association's conversion processing and search matching tree processing, then filter out optimal result from the result of matching tree, enter Step (7);
Step (7):The coding accuracy of assessment result, exports coding result and accuracy evaluation result.
2. disease automatic coding as claimed in claim 1, it is characterized in that, the word segmentation processing, refer to:Will language be segmented Sentence cutting is that several diagnosis keywords, the diagnosis keyword, including expression limit the qualifier of modification and represent disease Primary keyword;The qualifier refers to the word of descriptive nature, position or degree type;The primary keyword refers to describe disease Disease, abnormal structure, the word of abnormal body or abnormal symptom;
Association's conversion processing, refers to:Obtained qualifier will be segmented and primary keyword is marked on medical semantic network, profit Association's conversion is carried out respectively to qualifier and primary keyword with medical semantic network, former primary keyword and former primary keyword are associated The new primary keyword obtained after conversion carries out arrangement group with the new qualifier obtained after former qualifier and former qualifier association conversion Close, finally give all combinations between primary keyword and qualifier in raw diagnostic data;
The search matching tree processing, refers to:All groups between the primary keyword and qualifier that are obtained according to association's conversion processing Close, the matching tree that leaf is completely covered corresponding to each combination is searched out from matching forest, result is that a matching is set, be some Individual matching is set or without result.
3. disease automatic coding as claimed in claim 1, it is characterized in that, it is described to be screened from the result of matching tree Go out optimal result, refer to:
Step (a1):The keyword quantity on matching tree that will match to carries out descending arrangement and compared respectively, if ranked the first It is identical with second or have with the identical to rank the first it is multiple,
Compared with the keyword quantity split then is carried out into ascending order arrangement with the keyword ratio of number of matching tree;If obtain Rank the first identical with second or have with the identical to rank the first it is multiple, then into step (a2);
Step (a2):Qualifier and primary keyword are overlapped in the conversion distance of medical semantic network, stack result is entered The arrangement of row descending is compared, if rank the first identical with second or having with the identical to rank the first multiple, enters step Suddenly (a3);
Step (a3):The matching degree of matching tree is calculated, the matching degree of the matching tree is equal to the primary keyword that participle obtains Quantity with match tree primary keyword ratio of number, to ratio of number carry out descending arrangement compare;If rank the first with Second is identical or have with the identical to rank the first multiple, then terminates;
In step (a1)-step (a3), if optimal result only has one, that is, the result to rank the first only has one, then table It is Optimum Matching tree to show current matching tree.
4. disease automatic coding as claimed in claim 1, it is characterized in that,
Described to judge whether current word segmentation processing result contains two and its above disease name, basis for estimation is between word Conjunction whether there is, if conjunction be present, then it represents that current word segmentation processing result contains two and its above disease name, if not depositing In conjunction, then it represents that current word segmentation processing result does not contain two and its above disease name;
In the step (6), case history text is non-structured text, after carrying out word segmentation processing to patient file data, will be divided Unstructured data after word processing is converted to structural data, and structural data is stored classifiedly according to generic, affiliated Classification includes:Personnel, organ, time, place, frequency, symptom, operation, medicine, medical history, from the corresponding classification of structural data The information related to diagnosis is extracted as supplement keyword;Association's conversion processing and search matching tree are carried out to supplement keyword again Processing, optimal result then is filtered out from the result of matching tree, into step (7);The information related to diagnosis Including:Family history, Genetic history, disease property and time in pregnancy period.
5. disease automatic coding as claimed in claim 1, it is characterized in that,
The coding accuracy of the assessment result, from the matching degree of result and raw diagnostic data, diagnosis keyword in medical treatment Semantic network converts the difference that order of the keyword with diagnosing keyword in standard diagnostics is diagnosed in distance, raw diagnostic data Three angles of property are assessed;
The matching degree of the result and raw diagnostic data, it is:In all matching tree results matched, calculate first every Qualifier and primary keyword total quantity that group is syncopated as set the ratio of the qualifier included and primary keyword total quantity with matching, i.e., For the first ratio;Secondly calculate the primary keyword quantity being syncopated as with match tree primary keyword ratio of number, as second Ratio;The matching degree of second ratio and the first ratio, as result and raw diagnostic data;
The diagnosis keyword is in the conversion distance of medical semantic network:Keyword will each be diagnosed in medical semantic network Be transformed into the path length that diagnosis keyword corresponding to matching tree passes through and be denoted as a transformation ratio, calculate it is all match examine The transformation ratio natural logrithm sum of disconnected keyword, as diagnose conversion distance of the keyword in medical semantic network;
The otherness of order of the keyword with diagnosing keyword in ICD-10 is diagnosed in the raw diagnostic data, is:Count first Calculate same position of the diagnosis keyword in raw diagnostic data and in ICD-10 position sequence difference, then calculate all Diagnose the absolute value sum of the sequence difference of keyword.
6. disease automatic coding as claimed in claim 1, it is characterized in that, in the step (1),
Raw diagnostic data, scope include:Discharge diagnosis, pathological diagnosis and the damage of diagnosis, first page of illness case in patient file The external cause diagnosis of wound poisoning;
Patient file data, including:From first page of illness case, enter discharge record, progress note, operation record, pathological replacement or inspection Audit report, side information;
The side information includes:Age, sex, site of pathological change, disease property, peri-operation period, hospital infection disease, this examines Treat purpose, main diagnostic message, inspection, pathology, imaging information, familial, heredity, old, sequelae, congenital disease Disease, operation or the mode of production;
Qualifier, including:Position, disease property, orientation, disease parting, degree;
Keyword is diagnosed, including:Disease, abnormal structure's composition.
7. disease automatic coding as claimed in claim 1, it is characterized in that,
The medical semantic network is the semantic network of medical field, and the node body of medical semantic network is general for medical domain Read, the medical domain concept node is connected with other medical domain concept nodes, each medical domain concept node but with The disease concept form of expression node connection of itself;Each medical domain concept node also with property concept node, degree concept Node, position concept node or the connection of body concept node;Relation between the node of the medical semantic network is medical treatment neck Relation between the concept of domain;
The medical field concept, including:The physiology region of anatomy, body tissue, composition, disease exception, bacterial virus, pathology, Disease property;
Relation between medical domain concept, including:Correlation, transforming relationship, correlation weight, transforming relationship weight and Relation of the concept to specific manifestation;
Correlation between medical domain concept and concept, including:The including, be affiliated of concept, abstract or specific manifestation;
Transforming relationship between medical domain concept and concept, including:Concept it is close or identical.
8. disease automatic coding as claimed in claim 1, it is characterized in that,
More than one for disease quantity in raw diagnostic data, diagnosis cutting result can be divided into two or more parts, often Individual part includes a primary keyword and corresponding qualifier;During matching, various pieces scan for as the input of group With tree;
Matching forest includes several matching trees, each matching tree, including:Tree root, trunk, branch and leaf;The matching tree Tree root represent diagnosis concept, show as ICD coding;The trunk of the matching tree represents the performance diagnosis name of diagnosis concept; The branch of the matching tree represents the concrete composition part of diagnosis concept performance diagnosis name;The leaf of the matching tree represents to examine The qualifier and primary keyword of the concrete composition part of disconnected concept performance diagnosis name;
Match the forming process of forest:Single standard diagnostics are a concepts, and concept includes several forms of expression;Every kind of table Existing form have the structure of oneself, the conceptual entity included and comprising each conceptual entity between correlation;Each standard Diagnose represent concept, structure, the conceptual entity included and comprising each conceptual entity between correlation tree structure To represent, matching tree is defined as, and the matching tree of all standard diagnostics forms matching forest, also, according to icd standard guide, In forest is matched, priority and inclusion relation be present between matching tree;
The form of expression, including:Title;
Every kind of form of expression has the structure of oneself, including:Side by side, progressive explanation or cause and effect;
Every kind of form of expression has the conceptual entity included of oneself, including:Symptom, disease or operation;
Every kind of form of expression has the correlation between each conceptual entity included of oneself, including:Keyword and qualifier or Limit modification;
The root of the matching tree represents diagnosis concept, and its concept shows as ICD codings;
The search matching tree processing, there is three kinds of situations,
The first is no result, and the result of output is sky, then the reason for prompting is without matching result is original diagnostic information deficiency;
Second is to have result, and the optimal result selected that finally sorts only has one, then using this optimal result as final Matching result exports;
The third is that have result, and the optimal result selected has multiple, then output result is sky, and prompts the original without matching result Exported because being there are multiple matching degree identical results, and using multiple optimal results as a part for prompt message;Need weight New input diagnoses and more detailed information is provided in original basis.
9. disease automatic coding system, it is characterized in that, including:Memory, processor and storage are on a memory and in processor The computer instruction of upper operation, when the computer instruction is executed by processor, complete following steps:
Step (1):Receive input data:The input data includes:Raw diagnostic data and patient file data;
Step (2):Raw diagnostic data and patient file data to input pre-process;
Step (3):The pre-processed results obtained with step (2) are in GB/T 14396-2016《Classification of diseases and code》It is and international Retrieved in disease criterion sorting code number ICD-10, judge whether to obtain result, if obtaining result, direct exports coding knot Fruit;If it is not, into step (4);
Step (4):Pretreated raw diagnostic data is carried out at word segmentation processing, association's conversion processing and search matching tree Reason, optimal result then is filtered out from the result of matching tree, judges whether it is optimal result, if so, then entering step (7);If not optimal result, is judged as whether current word segmentation processing result contains two and its above disease name, if then Into step (5), otherwise into step (6);
Step (5):The result of step (4) is split as several single disease names, for each single disease Title carries out word segmentation processing, association's conversion processing and search matching tree processing, is then filtered out from the result of matching tree Optimal result, into step (7);
Step (6):Patient file data progress word segmentation processing, unstructured data to step (1) input are converted to structuring Data, association's conversion processing and search matching tree processing, then filter out optimal result from the result of matching tree, enter Step (7);
Step (7):The coding accuracy of assessment result, exports coding result and accuracy evaluation result.
10. a kind of computer-readable recording medium, it is characterized in that, it is stored thereon with computer instruction, the computer instruction quilt During computing device, following steps are completed:
Step (1):Receive input data:The input data includes:Raw diagnostic data and patient file data;
Step (2):Raw diagnostic data and patient file data to input pre-process;
Step (3):The pre-processed results obtained with step (2) are in GB/T 14396-2016《Classification of diseases and code》It is and international Retrieved in disease criterion sorting code number ICD-10, judge whether to obtain result, if obtaining result, direct exports coding knot Fruit;If it is not, into step (4);
Step (4):Pretreated raw diagnostic data is carried out at word segmentation processing, association's conversion processing and search matching tree Reason, optimal result then is filtered out from the result of matching tree, judges whether it is optimal result, if so, then entering step (7);If not optimal result, is judged as whether current word segmentation processing result contains two and its above disease name, if then Into step (5), otherwise into step (6);
Step (5):The result of step (4) is split as several single disease names, for each single disease Title carries out word segmentation processing, association's conversion processing and search matching tree processing, is then filtered out from the result of matching tree Optimal result, into step (7);
Step (6):Patient file data progress word segmentation processing, unstructured data to step (1) input are converted to structuring Data, association's conversion processing and search matching tree processing, then filter out optimal result from the result of matching tree, enter Step (7);
Step (7):The coding accuracy of assessment result, exports coding result and accuracy evaluation result.
CN201711013515.5A 2017-10-25 2017-10-25 Disease automatic coding method and system Active CN107705839B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711013515.5A CN107705839B (en) 2017-10-25 2017-10-25 Disease automatic coding method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711013515.5A CN107705839B (en) 2017-10-25 2017-10-25 Disease automatic coding method and system

Publications (2)

Publication Number Publication Date
CN107705839A true CN107705839A (en) 2018-02-16
CN107705839B CN107705839B (en) 2020-06-26

Family

ID=61181536

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711013515.5A Active CN107705839B (en) 2017-10-25 2017-10-25 Disease automatic coding method and system

Country Status (1)

Country Link
CN (1) CN107705839B (en)

Cited By (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108710636A (en) * 2018-04-09 2018-10-26 平安科技(深圳)有限公司 A kind of method, terminal device and the computer readable storage medium of screening case history
CN108920453A (en) * 2018-06-08 2018-11-30 医渡云(北京)技术有限公司 Data processing method, device, electronic equipment and computer-readable medium
CN109243599A (en) * 2018-03-16 2019-01-18 申朴信息技术(上海)股份有限公司 A kind of disease based on various dimensions information retrieval is to code method
CN109256216A (en) * 2018-08-14 2019-01-22 平安医疗健康管理股份有限公司 Medical data processing method, device, computer equipment and storage medium
CN109448860A (en) * 2018-09-10 2019-03-08 平安科技(深圳)有限公司 Disease data mapping method, device, computer equipment and storage medium
CN109493931A (en) * 2018-10-25 2019-03-19 平安科技(深圳)有限公司 A kind of coding method of patient file, server and computer readable storage medium
CN109543718A (en) * 2018-10-27 2019-03-29 平安医疗健康管理股份有限公司 The method and apparatus for modifying disease description
CN109698016A (en) * 2018-12-11 2019-04-30 中国科学院深圳先进技术研究院 Disease automatic coding and device
CN109754854A (en) * 2019-01-14 2019-05-14 上海市内分泌代谢病研究所 A kind of diagnosis coding and the matched method and system of diagnosis name
CN109993227A (en) * 2019-03-29 2019-07-09 京东方科技集团股份有限公司 Method, system, device and the medium of automatic addition International Classification of Diseases coding
CN109994215A (en) * 2019-04-25 2019-07-09 清华大学 Disease automatic coding system, method, equipment and storage medium
CN110265102A (en) * 2018-07-13 2019-09-20 首都医科大学附属北京友谊医院 Data processing method, device and electronic equipment
CN110491465A (en) * 2019-08-20 2019-11-22 山东众阳健康科技集团有限公司 Classification of diseases coding method, system, equipment and medium based on deep learning
WO2019238105A1 (en) * 2018-06-16 2019-12-19 深圳市前海安测信息技术有限公司 Medical insurance interface code matching method, computer device and readable storage medium
CN110895580A (en) * 2019-12-12 2020-03-20 山东众阳健康科技集团有限公司 ICD operation and operation code automatic matching method based on deep learning
CN110968234A (en) * 2018-09-28 2020-04-07 智龄科技股份有限公司 Intelligent nursing text generation system and intelligent nursing information platform using same
CN111091881A (en) * 2019-12-28 2020-05-01 北京颐圣智能科技有限公司 Medical information classification method, medical classification information storage method and computing device
CN111180024A (en) * 2019-12-13 2020-05-19 平安医疗健康管理股份有限公司 Data processing method and device based on word frequency and inverse document frequency and computer equipment
CN111192667A (en) * 2019-12-16 2020-05-22 山东众阳健康科技集团有限公司 Method for prompting operation code based on intelligent operation
CN111276203A (en) * 2020-01-20 2020-06-12 和宇健康科技股份有限公司 Automatic tumor registration and analysis method and system
CN111540468A (en) * 2020-04-21 2020-08-14 重庆大学 ICD automatic coding method and system for visualization of diagnosis reason
CN111863158A (en) * 2020-07-06 2020-10-30 天津市疾病预防控制中心 Automatic coding system for tumor registration
CN112148882A (en) * 2020-11-27 2020-12-29 北京惠及智医科技有限公司 Medical record text processing method and device, electronic equipment and storage medium
CN112183104A (en) * 2020-08-26 2021-01-05 望海康信(北京)科技股份公司 Code recommendation method, system and corresponding equipment and storage medium
CN112560676A (en) * 2020-12-15 2021-03-26 中国平安人寿保险股份有限公司 Image text extraction method, device and storage medium
CN112837765A (en) * 2021-01-12 2021-05-25 山东众阳健康科技集团有限公司 Automatic coding method and system for disease diagnosis
CN112836006A (en) * 2021-01-12 2021-05-25 山东众阳健康科技集团有限公司 Multi-diagnosis intelligent coding method, system, medium and equipment
CN112836512A (en) * 2021-01-27 2021-05-25 山东众阳健康科技集团有限公司 ICD-11 coding retrieval method based on natural semantic processing and knowledge graph
CN112861535A (en) * 2021-01-18 2021-05-28 山东众阳健康科技集团有限公司 Surgery classification coding method and system based on diagnosis and treatment data
CN113436730A (en) * 2021-07-14 2021-09-24 山大地纬软件股份有限公司 Hospital disease diagnosis classification automatic coding method and system
WO2021217850A1 (en) * 2020-04-26 2021-11-04 平安科技(深圳)有限公司 Disease name code matching method and apparatus, computer device and storage medium
CN113657114A (en) * 2021-08-31 2021-11-16 平安医疗健康管理股份有限公司 Method, device, equipment and storage medium for generating disease name code matching list
CN113722418A (en) * 2021-08-30 2021-11-30 平安科技(深圳)有限公司 Clinical case standardization method, device, equipment and medium
CN114255839A (en) * 2022-01-26 2022-03-29 广州天鹏计算机科技有限公司 Medical big data management system and method
CN112560676B (en) * 2020-12-15 2024-11-12 中国平安人寿保险股份有限公司 Image text extraction method, device and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140181128A1 (en) * 2011-03-07 2014-06-26 Daniel J. RISKIN Systems and Methods for Processing Patient Data History
CN105069124A (en) * 2015-08-13 2015-11-18 易保互联医疗信息科技(北京)有限公司 Automatic ICD (International Classification of Diseases) coding method and system
CN105069123A (en) * 2015-08-13 2015-11-18 易保互联医疗信息科技(北京)有限公司 Automatic coding method and system for Chinese surgical operation information
CN105303064A (en) * 2015-12-03 2016-02-03 万达信息股份有限公司 Method for building medical index system based on semantic model
CN106844308A (en) * 2017-01-20 2017-06-13 天津艾登科技有限公司 A kind of use semantics recognition carries out the method for automating disease code conversion
CN106934220A (en) * 2017-02-24 2017-07-07 黑龙江特士信息技术有限公司 Towards the disease class entity recognition method and device of multi-data source

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140181128A1 (en) * 2011-03-07 2014-06-26 Daniel J. RISKIN Systems and Methods for Processing Patient Data History
CN105069124A (en) * 2015-08-13 2015-11-18 易保互联医疗信息科技(北京)有限公司 Automatic ICD (International Classification of Diseases) coding method and system
CN105069123A (en) * 2015-08-13 2015-11-18 易保互联医疗信息科技(北京)有限公司 Automatic coding method and system for Chinese surgical operation information
CN105303064A (en) * 2015-12-03 2016-02-03 万达信息股份有限公司 Method for building medical index system based on semantic model
CN106844308A (en) * 2017-01-20 2017-06-13 天津艾登科技有限公司 A kind of use semantics recognition carries out the method for automating disease code conversion
CN106934220A (en) * 2017-02-24 2017-07-07 黑龙江特士信息技术有限公司 Towards the disease class entity recognition method and device of multi-data source

Cited By (52)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109243599A (en) * 2018-03-16 2019-01-18 申朴信息技术(上海)股份有限公司 A kind of disease based on various dimensions information retrieval is to code method
CN108710636B (en) * 2018-04-09 2024-02-23 平安科技(深圳)有限公司 Medical record screening method, terminal equipment and computer readable storage medium
CN108710636A (en) * 2018-04-09 2018-10-26 平安科技(深圳)有限公司 A kind of method, terminal device and the computer readable storage medium of screening case history
CN108920453A (en) * 2018-06-08 2018-11-30 医渡云(北京)技术有限公司 Data processing method, device, electronic equipment and computer-readable medium
CN108920453B (en) * 2018-06-08 2023-03-24 国家食品药品监督管理总局药品评价中心 Data processing method and device, electronic equipment and computer readable medium
WO2019238105A1 (en) * 2018-06-16 2019-12-19 深圳市前海安测信息技术有限公司 Medical insurance interface code matching method, computer device and readable storage medium
CN110265102A (en) * 2018-07-13 2019-09-20 首都医科大学附属北京友谊医院 Data processing method, device and electronic equipment
CN109256216A (en) * 2018-08-14 2019-01-22 平安医疗健康管理股份有限公司 Medical data processing method, device, computer equipment and storage medium
CN109256216B (en) * 2018-08-14 2023-06-27 平安医疗健康管理股份有限公司 Medical data processing method, medical data processing device, computer equipment and storage medium
CN109448860B (en) * 2018-09-10 2024-09-17 平安科技(深圳)有限公司 Disease data mapping method, device, computer equipment and storage medium
CN109448860A (en) * 2018-09-10 2019-03-08 平安科技(深圳)有限公司 Disease data mapping method, device, computer equipment and storage medium
WO2020052162A1 (en) * 2018-09-10 2020-03-19 平安科技(深圳)有限公司 Disease data mapping method and device, computer device and storage medium
CN110968234A (en) * 2018-09-28 2020-04-07 智龄科技股份有限公司 Intelligent nursing text generation system and intelligent nursing information platform using same
CN110968234B (en) * 2018-09-28 2021-08-10 智龄科技股份有限公司 Intelligent nursing text generation system and intelligent nursing information platform using same
CN109493931A (en) * 2018-10-25 2019-03-19 平安科技(深圳)有限公司 A kind of coding method of patient file, server and computer readable storage medium
CN109543718B (en) * 2018-10-27 2024-04-09 平安医疗健康管理股份有限公司 Method and device for modifying disease type description
CN109543718A (en) * 2018-10-27 2019-03-29 平安医疗健康管理股份有限公司 The method and apparatus for modifying disease description
CN109698016A (en) * 2018-12-11 2019-04-30 中国科学院深圳先进技术研究院 Disease automatic coding and device
CN109754854A (en) * 2019-01-14 2019-05-14 上海市内分泌代谢病研究所 A kind of diagnosis coding and the matched method and system of diagnosis name
CN109993227A (en) * 2019-03-29 2019-07-09 京东方科技集团股份有限公司 Method, system, device and the medium of automatic addition International Classification of Diseases coding
CN109994215A (en) * 2019-04-25 2019-07-09 清华大学 Disease automatic coding system, method, equipment and storage medium
WO2021032219A3 (en) * 2019-08-20 2021-04-15 山东众阳健康科技集团有限公司 Method and system for disease classification coding based on deep learning, and device and medium
CN110491465A (en) * 2019-08-20 2019-11-22 山东众阳健康科技集团有限公司 Classification of diseases coding method, system, equipment and medium based on deep learning
CN110895580B (en) * 2019-12-12 2020-07-07 山东众阳健康科技集团有限公司 ICD operation and operation code automatic matching method based on deep learning
CN110895580A (en) * 2019-12-12 2020-03-20 山东众阳健康科技集团有限公司 ICD operation and operation code automatic matching method based on deep learning
CN111180024B (en) * 2019-12-13 2023-03-21 深圳平安医疗健康科技服务有限公司 Data processing method and device based on word frequency and inverse document frequency and computer equipment
CN111180024A (en) * 2019-12-13 2020-05-19 平安医疗健康管理股份有限公司 Data processing method and device based on word frequency and inverse document frequency and computer equipment
CN111192667A (en) * 2019-12-16 2020-05-22 山东众阳健康科技集团有限公司 Method for prompting operation code based on intelligent operation
CN111091881B (en) * 2019-12-28 2023-12-19 北京颐圣智能科技有限公司 Medical information classification method, medical classified information storage method and computing device
CN111091881A (en) * 2019-12-28 2020-05-01 北京颐圣智能科技有限公司 Medical information classification method, medical classification information storage method and computing device
CN111276203A (en) * 2020-01-20 2020-06-12 和宇健康科技股份有限公司 Automatic tumor registration and analysis method and system
CN111276203B (en) * 2020-01-20 2024-03-01 和宇健康科技股份有限公司 Tumor automatic registration analysis method and system
CN111540468A (en) * 2020-04-21 2020-08-14 重庆大学 ICD automatic coding method and system for visualization of diagnosis reason
CN111540468B (en) * 2020-04-21 2023-05-16 重庆大学 ICD automatic coding method and system for visualizing diagnostic reasons
WO2021217850A1 (en) * 2020-04-26 2021-11-04 平安科技(深圳)有限公司 Disease name code matching method and apparatus, computer device and storage medium
CN111863158A (en) * 2020-07-06 2020-10-30 天津市疾病预防控制中心 Automatic coding system for tumor registration
CN112183104A (en) * 2020-08-26 2021-01-05 望海康信(北京)科技股份公司 Code recommendation method, system and corresponding equipment and storage medium
CN112148882B (en) * 2020-11-27 2021-09-10 北京惠及智医科技有限公司 Medical record text processing method and device, electronic equipment and storage medium
CN112148882A (en) * 2020-11-27 2020-12-29 北京惠及智医科技有限公司 Medical record text processing method and device, electronic equipment and storage medium
CN112560676B (en) * 2020-12-15 2024-11-12 中国平安人寿保险股份有限公司 Image text extraction method, device and storage medium
CN112560676A (en) * 2020-12-15 2021-03-26 中国平安人寿保险股份有限公司 Image text extraction method, device and storage medium
CN112837765B (en) * 2021-01-12 2023-01-20 山东众阳健康科技集团有限公司 Automatic coding method and system for disease diagnosis
CN112836006A (en) * 2021-01-12 2021-05-25 山东众阳健康科技集团有限公司 Multi-diagnosis intelligent coding method, system, medium and equipment
CN112837765A (en) * 2021-01-12 2021-05-25 山东众阳健康科技集团有限公司 Automatic coding method and system for disease diagnosis
CN112836006B (en) * 2021-01-12 2022-09-23 山东众阳健康科技集团有限公司 Multi-diagnostic intelligent coding method, system, medium and equipment
CN112861535A (en) * 2021-01-18 2021-05-28 山东众阳健康科技集团有限公司 Surgery classification coding method and system based on diagnosis and treatment data
CN112861535B (en) * 2021-01-18 2023-11-14 众阳健康科技集团有限公司 Surgical classification coding method and system based on diagnosis and treatment data
CN112836512A (en) * 2021-01-27 2021-05-25 山东众阳健康科技集团有限公司 ICD-11 coding retrieval method based on natural semantic processing and knowledge graph
CN113436730A (en) * 2021-07-14 2021-09-24 山大地纬软件股份有限公司 Hospital disease diagnosis classification automatic coding method and system
CN113722418A (en) * 2021-08-30 2021-11-30 平安科技(深圳)有限公司 Clinical case standardization method, device, equipment and medium
CN113657114A (en) * 2021-08-31 2021-11-16 平安医疗健康管理股份有限公司 Method, device, equipment and storage medium for generating disease name code matching list
CN114255839A (en) * 2022-01-26 2022-03-29 广州天鹏计算机科技有限公司 Medical big data management system and method

Also Published As

Publication number Publication date
CN107705839B (en) 2020-06-26

Similar Documents

Publication Publication Date Title
CN107577826B (en) Classification of diseases coding method and system based on raw diagnostic data
CN107705839A (en) Disease automatic coding and system
CN107731269A (en) Disease code method and system based on raw diagnostic data and patient file data
CN111414393B (en) Semantic similar case retrieval method and equipment based on medical knowledge graph
CN113241135B (en) Disease risk prediction method and system based on multi-modal fusion
US11200379B2 (en) Ontologically driven procedure coding
CN109460473B (en) Electronic medical record multi-label classification method based on symptom extraction and feature representation
US20210034813A1 (en) Neural network model with evidence extraction
CN111222340B (en) Breast electronic medical record entity recognition system based on multi-standard active learning
CN112151188A (en) Intelligent disease prediction system based on medical knowledge map
CN110085308B (en) Diagnosis and treatment department classification method based on fusion deep learning
CN110321556A (en) A kind of method and its system of doctor's diagnosis and treatment medical insurance control expense intelligent recommendation scheme
CN111477320B (en) Treatment effect prediction model construction system, treatment effect prediction system and terminal
CN105677637A (en) Method and device for updating abstract semantics database in intelligent question-answering system
CN117033568A (en) Medical data index interpretation method, device, storage medium and equipment
CN117474507A (en) Intelligent recruitment matching method and system based on big data application technology
CN117235243A (en) Training optimization method for large language model of civil airport and comprehensive service platform
CN109686443A (en) A kind of clinical diagnosis aid decision-making system and medical knowledge map accumulative means
CN111063446A (en) Method, apparatus, device and storage medium for standardizing medical text data
CN111128388A (en) Value domain data matching method and device and related products
CN110232328A (en) A kind of reference report analytic method, device and computer readable storage medium
CN108491487A (en) A kind of clinical guidelines knowledge encoding method and system
CN114254083A (en) Medical special term unsupervised clustering method, device and system
CN111192667A (en) Method for prompting operation code based on intelligent operation
CN110515926A (en) Heterogeneous data source mass data carding method based on participle and semantic dependency analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 250101 12 / F, building 1, Aosheng building, 1166 Xinluo street, hi tech Zone, Jinan City, Shandong Province

Patentee after: SHANDONG MSUNHEALTH TECHNOLOGY GROUP Co.,Ltd.

Address before: Xinluo Avenue high tech Zone of Ji'nan City, Shandong province 250000 No. 1166 orsus No. 1 building, 12 floor

Patentee before: SHANDONG MSUNSOFT Co.,Ltd.

CP01 Change in the name or title of a patent holder
CP01 Change in the name or title of a patent holder

Address after: 250101 Floor 12, Building 1, Aosheng Building, No. 1166, Xinluo Street, High tech Zone, Jinan, Shandong

Patentee after: Zhongyang Health Technology Group Co.,Ltd.

Address before: 250101 Floor 12, Building 1, Aosheng Building, No. 1166, Xinluo Street, High tech Zone, Jinan, Shandong

Patentee before: SHANDONG MSUNHEALTH TECHNOLOGY GROUP Co.,Ltd.