CN109697285B - Hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation - Google Patents
Hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation Download PDFInfo
- Publication number
- CN109697285B CN109697285B CN201811523661.7A CN201811523661A CN109697285B CN 109697285 B CN109697285 B CN 109697285B CN 201811523661 A CN201811523661 A CN 201811523661A CN 109697285 B CN109697285 B CN 109697285B
- Authority
- CN
- China
- Prior art keywords
- word
- feature
- character
- vector
- bilstm
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16H—HEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
- G16H10/00—ICT specially adapted for the handling or processing of patient-related medical or healthcare data
- G16H10/60—ICT specially adapted for the handling or processing of patient-related medical or healthcare data for patient-specific data, e.g. for electronic patient records
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Computing Systems (AREA)
- Software Systems (AREA)
- Molecular Biology (AREA)
- Data Mining & Analysis (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- Mathematical Physics (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Medical Informatics (AREA)
- Primary Health Care (AREA)
- Public Health (AREA)
- Epidemiology (AREA)
- Medical Treatment And Welfare Office Work (AREA)
Abstract
The invention discloses a hierarchical BilSTM Chinese electronic medical record disease coding and labeling method for enhancing semantic expression, which comprises the steps of preprocessing an input electronic medical record text, considering Chinese word composition, extracting character-level feature vector expression by using a BilSTM introducing an attention mechanism to obtain the semantic and word-forming features of a single Chinese character; splicing the character-level word vector representation and the word-level vector representation obtained by word2vec training to obtain a word vector representation with enhanced character features; and taking the text sequence represented by the feature word vector as input, learning the context features in the whole electronic medical record by using the BilSTM again, and calculating the contribution degree of each feature word by adopting an attention mechanism to obtain the text vector representation weighted by the context features, thereby improving the prediction effect. The method is suitable for the disease label classification task based on the Chinese electronic medical record text, and effectively improves the classification effect.
Description
Technical Field
The invention relates to the field of medical informatics, in particular to a hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation.
Background
Electronic Health Records (EHRs) have become one of the important data resources in clinical research. The medical data storage system stores various information of patients in the hospitalizing process as digital data, and facilitates the analysis and processing of clinical data by a computer. For an electronic medical record, there is a need for a uniform label specification that describes the disease status of the patient, thereby facilitating the rational classification of patient information to aid in clinical decision-making. International Classification of Diseases (ICD), issued by the world health organization and continuously updated, is an International universal disease coding scheme that is often used as a label for clinical records to identify symptoms, signs, Diseases, abnormal findings or operations, and the like. Currently, the newly revised ICD code version 10 has been widely used in hospital information systems in our country.
Labeling ICD codes on electronic medical records is an important and fundamental task for utilizing electronic medical records. The absence of diagnosis names and ICD codes in electronic medical records is not favorable for the analysis and research of clinical data. Normally, the ICD-coded labeling work is manually determined by medical staff in each hospital case according to the clinical diagnosis description given by a doctor. Manual coding not only requires a coding person to know certain medical knowledge, coding rules and medical terms, but also is time-consuming and labor-consuming. Therefore, the automatic coding by using the computer can provide effective assistance for coding labeling work, and the labeling efficiency of the ICD coding is improved.
At present, most of automatic labeling work of disease codes is carried out based on clinical text data, such as radiology reports, death certificates, discharge knots and the like. However, most research work is focused on english corpus, the work of disease coding prediction on chinese clinical texts is less, and the main method is character string semantic comparison based on diagnosis names. The semantic similarity comparison has high requirements on the quality of the diagnosis name description, and automatic coding labeling cannot be performed under the condition that the diagnosis name is missing. At present, no relevant research work uses a neural network model for a disease coding and labeling task of a Chinese electronic medical record.
The processing of the Chinese electronic medical record text has two characteristics: firstly, the electronic medical record has a long text, and the context information of the long text is difficult to obtain; secondly, Chinese characters are different from English, single Chinese characters also have semantics, and particularly in medical terms, such as direction, body parts and the like are described by one Chinese character, so that the semantics containing character features can better express the semantics of words.
Disclosure of Invention
The invention aims to solve the technical problem that aiming at the defects of the prior art, the invention provides a hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation, which finishes automatic labeling in an end-to-end mode and improves the prediction effect.
In order to solve the technical problems, the technical scheme adopted by the invention is as follows:
a hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation comprises the following steps:
1) utilizing a Chinese word segmentation tool, introducing a user-defined medical clinical word dictionary for word segmentation, removing stop words, and screening out characteristic words according to word frequency;
2) respectively carrying out character-level and word-level vectorization representation on the feature words, splicing character-level vectors and word-level vectors, and constructing character enhancement feature vector representation of the words;
3) and obtaining the context characteristics of the whole text by utilizing the spliced characteristic words, and calculating the contribution degree of each characteristic word by adopting an attention mechanism to obtain the context characteristic weighting vector representation of the whole text.
In the step 1), the feature words are selected according to the following rules:wherein SfwA set of characteristic words is represented,the expression wiFrequency of (1), NdAnd the total number of samples of the electronic medical record is shown.
In the step 2), the character-level feature vector representation of the bidirectional LSTM training feature words fused with the attention mechanism is utilized, and the word-level vector representation form of the feature words is obtained by utilizing a word vector representation method word2vec based on word distributed representation.
The output mode of the bidirectional long-short term memory network training is as follows:whereinRepresenting the hidden layer output of the forward LSTM at the t-th element or time,then it is the backward LSTM output at the hidden layer of the t-th cell.
The calculation mode of the attention mechanism is as follows:
uij=tanh(Wchij+bc);
hijis the hidden layer output of the jth character of the ith word after the BilSTM training, WcAs a weight matrix, bcAs an offset vector, ucFor randomly initializing a character-level context feature vector, alphaijFor the weight of the jth character to the ith word calculated by the softmax function,the feature vector representation is weighted for the context of the ith word.
In step 3), the method for calculating the context feature weighting vector of the whole text comprises the following steps: and inputting the text represented by the spliced feature word vector into a second-layer bidirectional long-short term memory network, learning to obtain the context features of the whole text, and calculating the weight of each feature word by adopting an attention mechanism to obtain the text feature vector weighted by context information.
The calculation mode of the attention mechanism is as follows:
ui=tanh(Whi+bw);
v=∑iαihi;
hiis character ofThe character of the ith word of the sequence is strengthened and the characteristic vector is trained by the BilSTM to obtain the output of a hidden layer, W is a weight matrix, bwFor a bias vector, when an attention mechanism is applied, a document context feature vector u at a word level is correspondingly introduced and randomly initializedwTo complete the calculation of the weight, alphaiAnd v is represented by the context weighted feature vector of the whole text for the weight corresponding to each word, the vector is input into a full connection layer, and the occurrence probability of each disease code is calculated by a sigmoid function.
Compared with the prior art, the invention has the beneficial effects that: aiming at the characteristics of Chinese, the semantic features of single Chinese characters are integrated into the feature vector representation of words, and the feature words really contributing to the input sequence are weighted by combining an attention mechanism, so that the prediction effect of disease coding is improved; the method is suitable for Chinese clinical text data, text features are automatically extracted by utilizing a neural network model, and automatic labeling is completed in an end-to-end mode.
Drawings
FIG. 1 is a flow chart of the present invention;
FIG. 2 is a hierarchical BilSTM feature learning model incorporating a focus mechanism;
FIG. 3 concerns the computation of a mechanism; (a) h is to beijBecome uij(ii) a (b) Computing each u using context feature vectorsijThe weight of (c); (c) h isijObtaining the feature vector representation of the application attention mechanism by the weighted summation;
FIG. 4 is a graph showing the results of an experiment performed in the present invention.
Detailed Description
Preprocessing of clinical text data
Utilizing a Chinese word segmentation tool 'knot' and a user-defined medical word library to segment input discharge summary texts, removing stop words, counting word frequencies of effective words, selecting characteristic words after ordering from large to small based on the word frequencies, and selecting the characteristic words according to the following rules:wherein SfwRepresenting a set of feature words,The expression wiFrequency of (1), NdRepresenting the total number of electronic medical records.
Second, word vector representation of feature words
1) Character-based word vector representation
Firstly, initializing a vector representation for each character, then inputting a BilSTM fused with a focus mechanism, training to obtain a character-level word vector representation of each characteristic word, and obtaining a state value c of each neural unit in the BilSTMtAnd the output value htThe specific calculation process is (t ═ 1, 2.., n, t denotes the t-th neural unit in the network or the neural unit at time t):
it=sigmoid(Wi[xt;ht-1]+bi) (1)
ft=sigmoid(Wf[xt;ht-1]+bf) (2)
gt=tanh(Wg[xt;ht-1]+bg) (3)
ot=sigmoid(Wo[xt;ht-1]+bo) (4)
ct=ft*ct-1+it*gt (5)
ht=ot*tanh(ct) (6)
each neural unit comprises an input gate i, an output gate o, a forgetting gate f, a storage unit g, a state-saving unit c and a hidden state h which are vectors Wi,Wf,Wg,WoAs a weight matrix, bi,bf,bg,boIs an offset vector, "; "represents a join operation," + "represents a dot product of elements, and sigmoid function is calculated asthe tan h function is calculated asThe output mode of the BilSTM is
2) Application of attention mechanism
The calculation method of the attention mechanism comprises the following steps:
uij=tanh(Wchij+bc) (7)
hijis the hidden layer output of the jth character of the ith word after the BilSTM training, WcAs a weight matrix, bcAs an offset vector, ucFor randomly initializing a character-level context feature vector, alphaijI.e. the weight of the jth character to the ith word calculated by the softmax function,i.e. a context weighted feature vector representation of the ith word.
3) And splicing the character-level word vector obtained by training with the word vector generated by using word2vec to obtain a word feature vector with enhanced character-level context features.
Third, context feature extraction
Inputting the character-reinforced feature vector sequence into a BilSTM of a second-layer fusion attention mechanism, extracting text context information features, calculating a BilSTM neural unit and calculating context feature weighting, wherein the calculation is the same as that when a character-level word vector is expressed, and a specific calculation formula is as follows:
ui=tanh(Whi+bw) (10)
v=∑iαihi (12)
hithe method is characterized in that the character reinforcing feature vector of the ith word of a text sequence is output of a hidden layer obtained after BiLSTM training, W is a weight matrix, b iswFor a bias vector, when an attention mechanism is applied, a document context feature vector u at a word level is correspondingly introduced and randomly initializedwTo complete the calculation of the weight, alphaiAnd v is represented by the context weighted feature vector of the whole text for the weight corresponding to each word, the vector is input into a full connection layer, and the occurrence probability of each disease code is calculated by a sigmoid function.
Fourth, experimental verification
1) Procedure of experiment
In order to verify the effectiveness of the method, experimental verification is carried out on the clinical data of the real Chinese electronic medical record. The data set comprises 7732 discharge records and 1177 ICD-10 disease code labels, wherein the ICD-10 codes comprise point-division six-digit codes consisting of letters and numbers, the letters begin with the first three-digit codes which are primary codes and indicate disease categories. The average length of the discharge node is 610 words, and each discharge node corresponds to 3.6 disease codes on average.
The experiment was done on a server that contained 256GB memory and NVIDIA GeForce Titan X Pascal CUDA GPU processor. We split the data set into training and test sets on a 9:1 scale and verified by randomizing the data ten times. The evaluation index selects the precision (P) of micro-average, recall rate (R) and index F1 value of the combination of the precision (P) and the recall rate (R), and the Hamming loss value of the false alarm condition from the sample perspective. Higher values of F1 and lower Hamming loss values indicate better model performance.
2) Results of the experiment
As related research work indicates that the deep learning method is superior to the traditional machine learning method, we mainly carry out comparison experiments with other common neural network models, and the results are shown in Table 1, wherein MA-BilSTM represents our model, D2V + CNN is the method in related research work, and the method achieves the best effect on the open English data set MIMIC III at present. The experimental result shows that the MA-BilSTM is superior to other neural network models in all evaluation indexes, and the BilSTM combined with the attention mechanism can effectively capture the context information characteristics of long texts and improve the prediction effect.
TABLE 1 comparative experimental results
Model | Micro_P(CI:95%) | Micro_R(CI:95%) | Micro_F1(CI:95%) | hLoss(CI:95%) |
CBOW | 0.614(±6.43e-03) | 0.522(±5.30e-03) | 0.564(±4.52e-03) | 0.00248(±3.14e-05) |
CNN | 0.647(±6.67e-03) | 0.509(±6.51e-03) | 0.569(±4.71e-03) | 0.00237(±3.52e-05) |
D2V+CNN | 0.661(±9.57e-03) | 0.514(±8.74e-03) | 0.579(±7.14e-03) | 0.00231(±3.70e-05) |
MA-BiLSTM | 0.704(±1.13e-02) | 0.586(±5.84e-03) | 0.639(±4.45e-03) | 0.00204(±3.47e-05) |
To analyze the effect of each module of the model, we designed an ablation experiment for analysis, with the results shown in table 2. From experimental results, only word vectors or character vectors represent the characteristics of words in the text, and prediction results are reduced, so that word vector representation enhanced by the character vectors indeed brings better text characteristic representation. The attention mechanism plays an important role in the model, and the performance of the model is obviously reduced by removing the attention mechanism.
Prediction is performed on both ICD-10 full code and primary code, 7732 samples corresponding to 488 primary codes. The results of the experiment are shown in FIG. 4. The prediction result on the primary coding reaches 80.5 percent in precision, and the method can better assist the disease coding labeling work of medical staff in a medical record room.
TABLE 2 model ablation experimental results
Claims (4)
1. A hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation is characterized by comprising the following steps:
1) utilizing a Chinese word segmentation tool, introducing a user-defined medical clinical word dictionary to segment words of the discharge summary text, removing stop words, and screening out characteristic words according to word frequency;
2) respectively carrying out character level and word level vectorization representation on the feature words, splicing the character level vectors and the word level vectors, and constructing character enhancement feature vector representation of the words; using character-level feature vector representation of a BilSTM training feature word fused with an attention mechanism, and using a word vector representation method word2vec based on word distributed representation to obtain a word-level vector representation form of the feature word;
3) obtaining a word vector representation sequence of the whole text by using the spliced feature words, calculating the contribution degree of each feature word by using an attention mechanism, obtaining context feature weighted vector representation of the whole text, namely inputting the text represented by the spliced feature word vector into a second-layer bidirectional long-short term memory network, learning to obtain context features of the whole text, and calculating the weight of each feature word by using the attention mechanism, so as to obtain a text feature vector weighted by context information;
the calculation mode of the attention mechanism is as follows:
v=∑iαihi;
hiis the output of a hidden layer obtained after the character reinforcing characteristic vector of the ith word of the text sequence is trained by BilSTM, W is a weight matrix, bwFor a bias vector, when an attention mechanism is applied, a document context feature vector u at a word level is correspondingly introduced and randomly initializedwTo complete the calculation of the weight value, αiAnd v is represented by the context weighted feature vector of the whole text for the weight corresponding to each word, the vector is input into a full connection layer, and the occurrence probability of each disease code is calculated by a sigmoid function.
2. The method for disease-coding and labeling of BiLSTM Chinese electronic medical record with enhanced semantic representation according to claim 1, wherein in step 1), the feature words are selected according to the following rules:wherein SfwA set of characteristic words is represented,the expression wiFrequency of (N), NdAnd the total number of samples of the electronic medical record is shown.
3. The hierarchical BilSTM Chinese electronic medical record disease coding labeling method based on enhanced semantic representation according to claim 1, wherein the output mode of the BilSTM is as follows:whereinRepresenting the hidden layer output of the forward LSTM at the t-th element or time t,then it is output at the hidden layer of the t-th cell for backward LSTM.
4. The method for disease coding and labeling of BiLSTM Chinese electronic medical record with enhanced semantic representation according to claim 1, wherein in step 2), the calculation mode of the attention mechanism is as follows:
uij=tanh(Wchij+bc);
hijis the output of the j character of the i word in the hidden layer after the BilSTM training, WcAs a weight matrix, bcAs an offset vector, ucFor the random initialization of the character-level context feature vector, αijFor the weight of the jth character to the ith word calculated by the softmax function,the feature vector representation is weighted for the context of the ith word.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811523661.7A CN109697285B (en) | 2018-12-13 | 2018-12-13 | Hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811523661.7A CN109697285B (en) | 2018-12-13 | 2018-12-13 | Hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109697285A CN109697285A (en) | 2019-04-30 |
CN109697285B true CN109697285B (en) | 2022-06-21 |
Family
ID=66231615
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811523661.7A Active CN109697285B (en) | 2018-12-13 | 2018-12-13 | Hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109697285B (en) |
Families Citing this family (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110427610A (en) * | 2019-06-25 | 2019-11-08 | 平安科技(深圳)有限公司 | Text analyzing method, apparatus, computer installation and computer storage medium |
CN110491499A (en) * | 2019-07-10 | 2019-11-22 | 厦门大学 | Clinical aid decision-making method and system towards mark electronic health record |
CN110491465B (en) * | 2019-08-20 | 2020-09-15 | 山东众阳健康科技集团有限公司 | Disease classification coding method, system, device and medium based on deep learning |
CN110633470A (en) * | 2019-09-17 | 2019-12-31 | 北京小米智能科技有限公司 | Named entity recognition method, device and storage medium |
CN112632269A (en) * | 2019-09-24 | 2021-04-09 | 北京国双科技有限公司 | Method and related device for training document classification model |
CN110837494B (en) * | 2019-10-12 | 2022-03-25 | 云知声智能科技股份有限公司 | Method and device for identifying unspecified diagnosis coding errors of medical record home page |
CN110781407B (en) * | 2019-10-21 | 2024-07-23 | 腾讯科技(深圳)有限公司 | User tag generation method, device and computer readable storage medium |
CN110866401A (en) * | 2019-11-18 | 2020-03-06 | 山东健康医疗大数据有限公司 | Chinese electronic medical record named entity identification method and system based on attention mechanism |
CN110867231A (en) * | 2019-11-18 | 2020-03-06 | 中山大学 | Disease prediction method, device, computer equipment and medium based on text classification |
CN110895580B (en) * | 2019-12-12 | 2020-07-07 | 山东众阳健康科技集团有限公司 | ICD operation and operation code automatic matching method based on deep learning |
CN113012774B (en) * | 2019-12-18 | 2024-08-30 | 医渡云(北京)技术有限公司 | Automatic medical record coding method and device, electronic equipment and storage medium |
CN111429204A (en) * | 2020-03-10 | 2020-07-17 | 携程计算机技术(上海)有限公司 | Hotel recommendation method, system, electronic equipment and storage medium |
CN111914539B (en) * | 2020-07-31 | 2024-09-10 | 长江航道测量中心 | Channel notification information extraction method and system based on BiLSTM-CRF model |
CN112183104B (en) * | 2020-08-26 | 2024-06-14 | 望海康信(北京)科技股份公司 | Code recommendation method, system, corresponding equipment and storage medium |
CN112052646B (en) * | 2020-08-27 | 2024-03-29 | 安徽聚戎科技信息咨询有限公司 | Text data labeling method |
CN112185564B (en) * | 2020-10-20 | 2022-09-06 | 福州数据技术研究院有限公司 | Ophthalmic disease prediction method based on structured electronic medical record and storage device |
CN112380863A (en) * | 2020-10-29 | 2021-02-19 | 国网天津市电力公司 | Sequence labeling method based on multi-head self-attention mechanism |
CN112259260B (en) * | 2020-11-18 | 2023-11-17 | 中国科学院自动化研究所 | Intelligent medical question-answering method, system and device based on intelligent wearable equipment |
CN112732915A (en) * | 2020-12-31 | 2021-04-30 | 平安科技(深圳)有限公司 | Emotion classification method and device, electronic equipment and storage medium |
CN112632911B (en) * | 2021-01-04 | 2022-05-13 | 福州大学 | Chinese character coding method based on character embedding |
CN113593709B (en) * | 2021-07-30 | 2022-09-30 | 江先汉 | Disease coding method, system, readable storage medium and device |
CN116955628B (en) * | 2023-08-08 | 2024-05-03 | 武汉市万睿数字运营有限公司 | Complaint event classification method, complaint event classification device, computer equipment and storage medium |
CN116884630B (en) * | 2023-09-06 | 2024-08-23 | 深圳达实旗云健康科技有限公司 | Method for improving disease automatic coding efficiency |
CN117438024B (en) * | 2023-12-15 | 2024-03-08 | 吉林大学 | Intelligent acquisition and analysis system and method for acute diagnosis patient sign data |
CN118518984B (en) * | 2024-07-24 | 2024-09-27 | 新疆西部明珠工程建设有限公司 | Intelligent fault positioning system and method for power transmission and distribution line |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080288292A1 (en) * | 2007-05-15 | 2008-11-20 | Siemens Medical Solutions Usa, Inc. | System and Method for Large Scale Code Classification for Medical Patient Records |
WO2015084615A1 (en) * | 2013-12-03 | 2015-06-11 | 3M Innovative Properties Company | Constraint-based medical coding |
US10509889B2 (en) * | 2014-11-06 | 2019-12-17 | ezDI, Inc. | Data processing system and method for computer-assisted coding of natural language medical text |
EP3273373A1 (en) * | 2016-07-18 | 2018-01-24 | Fresenius Medical Care Deutschland GmbH | Drug dosing recommendation |
CN106484674B (en) * | 2016-09-20 | 2020-09-25 | 北京工业大学 | Chinese electronic medical record concept extraction method based on deep learning |
CN106844308B (en) * | 2017-01-20 | 2020-04-03 | 天津艾登科技有限公司 | Method for automatic disease code conversion using semantic recognition |
CN107644014A (en) * | 2017-09-25 | 2018-01-30 | 南京安链数据科技有限公司 | A kind of name entity recognition method based on two-way LSTM and CRF |
CN107731269B (en) * | 2017-10-25 | 2020-06-26 | 山东众阳软件有限公司 | Disease coding method and system based on original diagnosis data and medical record file data |
CN107977361B (en) * | 2017-12-06 | 2021-05-18 | 哈尔滨工业大学深圳研究生院 | Chinese clinical medical entity identification method based on deep semantic information representation |
CN108460013B (en) * | 2018-01-30 | 2021-08-20 | 大连理工大学 | Sequence labeling model and method based on fine-grained word representation model |
CN108536754A (en) * | 2018-03-14 | 2018-09-14 | 四川大学 | Electronic health record entity relation extraction method based on BLSTM and attention mechanism |
CN108628823B (en) * | 2018-03-14 | 2022-07-01 | 中山大学 | Named entity recognition method combining attention mechanism and multi-task collaborative training |
CN108628824A (en) * | 2018-04-08 | 2018-10-09 | 上海熙业信息科技有限公司 | A kind of entity recognition method based on Chinese electronic health record |
-
2018
- 2018-12-13 CN CN201811523661.7A patent/CN109697285B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN109697285A (en) | 2019-04-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109697285B (en) | Hierarchical BilSt Chinese electronic medical record disease coding and labeling method for enhancing semantic representation | |
CN109471895B (en) | Electronic medical record phenotype extraction and phenotype name normalization method and system | |
Cornegruta et al. | Modelling radiological language with bidirectional long short-term memory networks | |
CN110459282B (en) | Sequence labeling model training method, electronic medical record processing method and related device | |
Yu et al. | Automatic ICD code assignment of Chinese clinical notes based on multilayer attention BiRNN | |
US20210034813A1 (en) | Neural network model with evidence extraction | |
CN110705293A (en) | Electronic medical record text named entity recognition method based on pre-training language model | |
CN111966917A (en) | Event detection and summarization method based on pre-training language model | |
CN110442840B (en) | Sequence labeling network updating method, electronic medical record processing method and related device | |
CN106980608A (en) | A kind of Chinese electronic health record participle and name entity recognition method and system | |
Carchiolo et al. | Medical prescription classification: a NLP-based approach | |
CN110427486B (en) | Body condition text classification method, device and equipment | |
CN114023412B (en) | ICD code prediction method and system based on joint learning and denoising mechanism | |
CN111950283B (en) | Chinese word segmentation and named entity recognition system for large-scale medical text mining | |
Islam et al. | Challenges and future in deep learning for sentiment analysis: a comprehensive review and a proposed novel hybrid approach | |
CN112530584A (en) | Medical diagnosis assisting method and system | |
CN111145914B (en) | Method and device for determining text entity of lung cancer clinical disease seed bank | |
CN117787282B (en) | Doctor-patient text intelligent extraction method based on large language model | |
CN115859914A (en) | Diagnosis ICD automatic coding method and system based on medical history semantic understanding | |
CN111582506A (en) | Multi-label learning method based on global and local label relation | |
Hsu et al. | Multi-label classification of ICD coding using deep learning | |
Polignano et al. | A study of Machine Learning models for Clinical Coding of Medical Reports at CodiEsp 2020. | |
CN117194604B (en) | Intelligent medical patient inquiry corpus construction method | |
Zaghir et al. | Real-world patient trajectory prediction from clinical notes using artificial neural networks and UMLS-based extraction of concepts | |
CN116842168B (en) | Cross-domain problem processing method and device, electronic equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |