CN116759061B

CN116759061B - Physical examination project recommendation system based on personal demands

Info

Publication number: CN116759061B
Application number: CN202311035553.6A
Authority: CN
Inventors: 徐洪霞; 朱群仙
Original assignee: JIANYANG CITY PEOPLE'S HOSPITAL
Current assignee: JIANYANG CITY PEOPLE'S HOSPITAL
Priority date: 2023-08-17
Filing date: 2023-08-17
Publication date: 2023-10-27
Anticipated expiration: 2043-08-17
Also published as: CN116759061A

Abstract

The invention provides a physical examination project recommending system based on personal requirements, which belongs to the technical field of man-machine interaction systems.

Description

Physical examination project recommendation system based on personal demands

Technical Field

The invention relates to the technical field of human-computer interaction systems, in particular to a physical examination project recommendation system based on personal requirements.

Background

With the promotion of public health consciousness, the number of people attending physical examination is gradually increased, doctors and nurses are limited to hospitals, and due to different human body examination demands, the required physical examination items are different, if the required physical examination items are acquired through queuing or registration, the physical examination time is increased for the physical examination personnel, and the working intensity is increased for medical workers.

The existing physical examination guide and examination system is used for guiding physical examination staff to execute according to the procedure by pushing the physical examination procedure, realizing the standard procedure and reducing the physical examination time of the physical examination staff, but because the requirements of each person are different, the physical examination staff still need to guide by the medical care staff, thereby determining physical examination items and not reducing the workload of medical care work.

Disclosure of Invention

Aiming at the defects in the prior art, the physical examination item recommending system based on personal requirements solves the problem that the existing system for automatically acquiring physical examination items required by physical examination personnel is lacking.

In order to achieve the aim of the invention, the invention adopts the following technical scheme: a physical examination item recommendation system based on personal needs, comprising: the device comprises a voice acquisition denoising unit, a voice recognition unit, a keyword extraction unit, a keyword matching unit and a physical examination item pushing unit;

the voice acquisition denoising unit is used for acquiring voice signals of physical examination personnel and performing wavelet transformation denoising on the voice signals to obtain denoised voice signals; the voice recognition unit is used for recognizing the denoising voice signal to obtain text information; the keyword extraction unit is used for extracting physical examination keywords from the text information; the keyword matching unit is used for matching the physical examination keywords with the descriptions of each physical examination item and selecting the physical examination item successfully matched; the physical examination item pushing unit is used for pushing the physical examination items successfully matched with the corresponding physical examination processes.

Further, the voice acquisition denoising unit includes: a wavelet decomposition subunit, a wavelet coefficient selection subunit, and a reconstruction subunit;

the wavelet decomposition subunit is used for performing wavelet decomposition on the voice signal to obtain wavelet coefficients;

the wavelet coefficient selection subunit is used for updating the wavelet coefficient according to the threshold value to obtain an updated wavelet coefficient;

the reconstruction subunit is used for carrying out reconstruction processing on the updated wavelet coefficient to obtain a denoising voice signal.

Further, the expression of the updated wavelet coefficients is:

wherein ,for the updated->Layer wavelet coefficients, < >>For the%>Layer wavelet coefficients, < >>To update the coefficients.

The beneficial effects of the above further scheme are: the functions of the existing updated wavelet coefficients are segment functions, and the functions are discontinuous at the joint points of the segment functions, so that the precision of the updated wavelet coefficients is not high, therefore, the invention utilizes the continuous smoothing function with the function value between-1 and 1A new expression for updating the wavelet coefficients is constructed, so that the whole definition domain range is smooth, the wavelet coefficients can be precisely removed, and the denoising precision is improved.

Further, the update coefficientThe expression of (2) is:

wherein ,is->Threshold value of layer wavelet coefficient, < >>For the%>Layer wavelet coefficients.

The beneficial effects of the above further scheme are: updating coefficientsFollowing->Layer wavelet coefficients->And threshold->The adaptive change ensures that the function of the wavelet coefficient has good transition near the threshold value.

Further, the firstThreshold value of layer wavelet coefficient->The expression of (2) is:

wherein ,is a proportional coefficient->Is->The>Wavelet coefficient value,/">Is the number of wavelet coefficient values, +.>For the length of the speech signal, < >>For wavelet decomposition scale, +.>As a logarithmic function.

The beneficial effects of the above further scheme are: the invention utilizes the firstThe average value of wavelet coefficient values in the wavelet coefficients of the layers is used for estimating the threshold value, and the proportional coefficient is set for adjusting the threshold value.

Further, the voice recognition unit includes: the system comprises a convolution module, a residual error module, a first LSTM module, a second LSTM module, an attention module, a CNN network and a CTC classifier;

the input end of the convolution module is connected with the input end of the first LSTM module and is used as the input end of the voice recognition unit; the output end of the convolution module is connected with the input end of the residual error module; the input end of the second LSTM module is connected with the output end of the first LSTM module; the input end of the attention module is respectively connected with the output end of the residual error module and the output end of the second LSTM module, and the output end of the attention module is connected with the input end of the CNN network; the input end of the CTC classifier is connected with the output end of the CNN network, and the output end of the CTC classifier is used as the output end of the voice recognition unit.

The beneficial effects of the above further scheme are: the invention utilizes two paths to extract the characteristics of the denoising voice signal respectively, enriches the characteristic quantity, utilizes the LSTM module to have time memory, can better consider the history characteristics, the residual error module fuses the deep level and the shallow level sub-characteristics, the output of the LSTM module and the output of the residual error module are weighted and fused in the attention module, the processing is carried out according to the significance degree of each characteristic, the weight is applied to each characteristic in a self-adaption manner, the weighted and fused characteristics are input into the CNN network to carry out the deep extraction of the characteristics, and then the text information is output through the CTC classifier.

Further, the residual module includes: the system comprises a first convolution sub-module, a second convolution sub-module, a third convolution sub-module, an adder and a multiplier;

the input end of the first convolution sub-module is respectively connected with the input end of the third convolution sub-module and the first input end of the multiplier, and the output end of the first convolution sub-module is connected with the input end of the second convolution sub-module; the first input end of the adder is connected with the output end of the second convolution sub-module, the second input end of the adder is connected with the output end of the third convolution sub-module, and the output end of the adder is connected with the second input end of the multiplier; the output end of the multiplier is used as the output end of the residual error module.

The beneficial effects of the above further scheme are: according to the invention, the features processed by the two convolution sub-modules and the features processed by one convolution sub-module are added through the adder, so that the information quantity is improved, and then the multiplier is used for multiplying the input features, so that the problem of gradient disappearance can be solved, and the shallow features can be fused.

Further, the expression of the attention module is:

wherein ,for the output of the attention module, +.>For Concat splice operation, < >>Is->Weight of->For the output of the residual block,/>Is->Weight of->Is the output of the second LSTM module.

Further, the saidWeight of +.>The expression of (2) is:

wherein ,as an exponential function based on natural constants, < +.>To activate the function +.>For global pooling processing,/->Is the output of the residual error module;

the saidWeight of +.>The expression of (2) is:

wherein ,is the output of the second LSTM module.

The beneficial effects of the above further scheme are: in the invention respectively to and />Different weights are given, and splicing is carried out, so that feature fusion is realized, and the method is characterized in that +_> and />According to-> and />The self situation is calculated, and the self-adaption attention degree of the remarkable characteristics is improved.

Further, the loss function of the speech recognition unit is:

wherein ,for loss function->Is->Status function at sub-training>The true class in the denoised speech signal samples is equal to the class +.>When 1 is taken, if not, 0 is taken, +.>As an exponential function based on natural constants, < +.>Is->Predictive probability of speech recognition unit during secondary training, < >>For the current training times, +.>For a small scale adjacent to the training times, +.>For the total number of adjacent exercises>Is the number of categories.

The beneficial effects of the above further scheme are: in the present inventionWhen equal to 1>The closer to 0, the more the prediction differs from the tag, so the present invention uses the exponential function +.>To enhance this gap so that the loss function +.>The calculated loss value is large, the weight and bias in the voice recognition unit are reduced greatly, and the training time is shortened.

The technical scheme of the embodiment of the invention has at least the following advantages and beneficial effects: according to the invention, the voice signal of the physical examination personnel is collected through the voice collecting and denoising unit, the voice signal is denoised, the recognition precision is improved, the voice recognition unit is used for recognizing the denoised voice signal, the requirement of the physical examination personnel is obtained, the requirement keywords are extracted, the requirement keywords of the physical examination personnel are matched with the descriptions of each physical examination item, the successfully matched physical examination items and the corresponding physical examination process are pushed to the physical examination personnel, the workload of medical care work is reduced, and the queuing time of the physical examination personnel is further reduced.

Drawings

FIG. 1 is a system block diagram of a physical examination item recommendation system based on personal needs;

FIG. 2 is a block diagram of a speech recognition unit;

fig. 3 is a block diagram of the residual module.

Detailed Description

For the purpose of making the objects, technical solutions and advantages of the embodiments of the present invention more apparent, the technical solutions of the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention, and it is apparent that the described embodiments are some embodiments of the present invention, but not all embodiments of the present invention. The components of the embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.

As shown in fig. 1, a physical examination item recommendation system based on personal needs includes: the device comprises a voice acquisition denoising unit, a voice recognition unit, a keyword extraction unit, a keyword matching unit and a physical examination item pushing unit;

In this embodiment, the keyword extraction unit may store the keywords in the physical examination description in the memory, so as to match the keywords in the memory with the text information, the successfully matched keywords are the required keywords, and then match the keywords with the text description in the physical examination item, and when the keywords are successfully matched, the corresponding physical examination item is the required physical examination item.

The voice acquisition denoising unit comprises: a wavelet decomposition subunit, a wavelet coefficient selection subunit, and a reconstruction subunit;

The expression of the updated wavelet coefficients is:

The functions of the existing updated wavelet coefficients are segment functions, and the functions are discontinuous at the joint points of the segment functions, so that the precision of the updated wavelet coefficients is not high, therefore, the invention utilizes the continuous smoothing function with the function value between-1 and 1A new expression for updating the wavelet coefficients is constructed, so that the whole definition domain range is smooth, the wavelet coefficients can be precisely removed, and the denoising precision is improved.

The update coefficientThe expression of (2) is:

The invention updates the coefficientFollowing->Layer wavelet coefficients->And threshold->The adaptive change ensures that the function of the wavelet coefficient has good transition near the threshold value.

Said firstThreshold value of layer wavelet coefficient->The expression of (2) is:

The invention utilizes the firstThe average value of wavelet coefficient values in the wavelet coefficients of the layers is used for estimating the threshold value, and the proportional coefficient is set for adjusting the threshold value.

As shown in fig. 2, the voice recognition unit includes: the system comprises a convolution module, a residual error module, a first LSTM module, a second LSTM module, an attention module, a CNN network and a CTC classifier;

The invention utilizes two paths to extract the characteristics of the denoising voice signal respectively, enriches the characteristic quantity, utilizes the LSTM module to have time memory, can better consider the history characteristics, the residual error module fuses the deep level and the shallow level sub-characteristics, the output of the LSTM module and the output of the residual error module are weighted and fused in the attention module, the processing is carried out according to the significance degree of each characteristic, the weight is applied to each characteristic in a self-adaption manner, the weighted and fused characteristics are input into the CNN network to carry out the deep extraction of the characteristics, and then the text information is output through the CTC classifier.

As shown in fig. 3, the residual module includes: the system comprises a first convolution sub-module, a second convolution sub-module, a third convolution sub-module, an adder and a multiplier;

According to the invention, the features processed by the two convolution sub-modules and the features processed by one convolution sub-module are added through the adder, so that the information quantity is improved, and then the multiplier is used for multiplying the input features, so that the problem of gradient disappearance can be solved, and the shallow features can be fused.

In the present invention, the convolution sub-module and the convolution module each include: convolution layer, reLU layer and BN layer.

The expression of the attention module is:

The saidWeight of +.>The expression of (2) is:

the saidWeight of +.>The expression of (2) is:

wherein ,is the output of the second LSTM module.

In the invention respectively to and />Different weights are given, and splicing is carried out, so that feature fusion is realized, and the method is characterized in that +_> and />According to-> and />The self situation is calculated, and the self-adaption attention degree of the remarkable characteristics is improved.

The loss function of the voice recognition unit is as follows:

In the present inventionWhen equal to 1>The closer to 0, the more the prediction differs from the tag, so the present invention uses the exponential function +.>To enhance this gap so that the loss function +.>The calculated loss value is large, the weight and bias in the voice recognition unit are reduced greatly, and the training time is shortened.

In the present invention,for marking the current training times>For marking the number of adjacent exercises, selecting adjacent exercises>And (5) comprehensively evaluating the training condition according to the secondary condition.

The above is only a preferred embodiment of the present invention, and is not intended to limit the present invention, but various modifications and variations can be made to the present invention by those skilled in the art. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims

1. A physical examination item recommendation system based on personal needs, comprising: the device comprises a voice acquisition denoising unit, a voice recognition unit, a keyword extraction unit, a keyword matching unit and a physical examination item pushing unit;

the voice acquisition denoising unit is used for acquiring voice signals of physical examination personnel and performing wavelet transformation denoising on the voice signals to obtain denoised voice signals; the voice recognition unit is used for recognizing the denoising voice signal to obtain text information; the keyword extraction unit is used for extracting physical examination keywords from the text information; the keyword matching unit is used for matching the physical examination keywords with the descriptions of each physical examination item and selecting the physical examination item successfully matched; the physical examination item pushing unit is used for pushing the physical examination items successfully matched with the corresponding physical examination processes;

the voice recognition unit includes: the system comprises a convolution module, a residual error module, a first LSTM module, a second LSTM module, an attention module, a CNN network and a CTC classifier;

the input end of the convolution module is connected with the input end of the first LSTM module and is used as the input end of the voice recognition unit; the output end of the convolution module is connected with the input end of the residual error module; the input end of the second LSTM module is connected with the output end of the first LSTM module; the input end of the attention module is respectively connected with the output end of the residual error module and the output end of the second LSTM module, and the output end of the attention module is connected with the input end of the CNN network; the input end of the CTC classifier is connected with the output end of the CNN network, and the output end of the CTC classifier is used as the output end of the voice recognition unit;

the residual error module comprises: the system comprises a first convolution sub-module, a second convolution sub-module, a third convolution sub-module, an adder and a multiplier;

the input end of the first convolution sub-module is respectively connected with the input end of the third convolution sub-module and the first input end of the multiplier, and the output end of the first convolution sub-module is connected with the input end of the second convolution sub-module; the first input end of the adder is connected with the output end of the second convolution sub-module, the second input end of the adder is connected with the output end of the third convolution sub-module, and the output end of the adder is connected with the second input end of the multiplier; the output end of the multiplier is used as the output end of the residual error module;

the expression of the attention module is:

wherein ,for the output of the attention module, +.>For Concat splice operation, < >>Is->Weight of->For the output of the residual block,/>Is->Weight of->Is the output of the second LSTM module;

the saidWeight of +.>The expression of (2) is:

wherein ,is the output of the second LSTM module.

2. The personal demand based physical examination item recommendation system of claim 1, wherein the voice acquisition denoising unit comprises: a wavelet decomposition subunit, a wavelet coefficient selection subunit, and a reconstruction subunit;

3. The personal demand based physical examination item recommendation system of claim 2, wherein the expression of the updated wavelet coefficients is:

4. The personal demand based physical examination item recommendation system of claim 3, wherein the update coefficientsThe expression of (2) is:

5. The personal demand based physical examination item recommendation system of claim 4, wherein the firstThreshold value of layer wavelet coefficient->The expression of (2) is:

6. The personal demand based physical examination item recommendation system of claim 1 wherein said voice recognition unit has a loss function of:

wherein ,for loss function->Is->Status function at sub-training>The true class in the denoised speech signal samples is equal to the class +.>When 1 is taken, if not, 0 is taken, +.>As an exponential function based on natural constants, < +.>Is the firstPredictive probability of speech recognition unit during secondary training, < >>For the current training times, +.>For a small scale adjacent to the training times, +.>For the total number of adjacent exercises>Is the number of categories.