Multifunctional calling extension set
Technical field
The present invention relates to Multifunctional calling extension set, belong to field of medical device.
Background technology
Calling extension set mainly coordinates nurse station main frame to use, and intercommunication communication between main frame.
Owing to the berth of hospital is more, the network bandwidth cannot meet the requirement of many extension sets simultaneous communications.Prior art does not has
It is provided with a solution and carries out screening, compression to reduce the demand to bandwidth for calling extension set being transmitted data.
Summary of the invention
For reducing the calling extension set demand to bandwidth, the present invention proposes a kind of Multifunctional calling extension set.
Technical solution of the present invention is as follows:
Multifunctional calling extension set, including main control chip, also includes the mike being each connected with main control chip respectively, raises
Sound device and receiver module, described main control chip connects also by level cache discrimination module, and described discrimination module passes through two
Level caching connection has sending module;
Described discrimination module, based on fpga chip, has been built in described fpga chip for differentiating invalid voice data slot
BP neutral net, described BP neutral net differentiate invalid voice data slot method step be:
(A) voice that mike is included by main control chip is converted into speech data, by below 50Hz in this speech data and
The frequency range of more than 1200Hz all filters, then the speech data after filtering is divided into speech-sound data sequence will in units of 3s
This speech-sound data sequence leaves in level cache;
Main control chip is successively read the element in speech-sound data sequence from level cache, is handled as follows respectively:
(A-1) the ensemble average decibel value remembering this element is x1, overall code check be x2;
(A-2) this element is carried out frequency-domain analysis, with 50Hz as starting point, calculate the rate of change often crossing 50Hz decibel value, note
Recording first rate of change frequency values more than 0.1dB/Hz is x3, first rate of change frequency values less than-0.1dB/Hz be x4;
If not finding qualified x3, then by x3It is set as 50Hz, if not finding qualified x4, then by x4It is set as 1200Hz;
(A-3) x is calculated3To x4The average decibel value of frequency range is x5;
(A-4) by x1、x2、x3、x4And x5Store to level cache as one group of input data;
(B) it is sequentially delivered in the BP neutral net of discrimination module sentence by each group input data in level cache
Not;
This BP neutral net is disposed with input layer, pretreatment layer, intermediate layer and output layer along input to outbound course;
Described input layer includes for inputting x1Input block one, be used for inputting x2Input block two, be used for inputting x3
Input block three, be used for inputting x4Input block four and be used for inputting x5Input block five;
Described pretreatment layer includes pretreatment unit one, pretreatment unit two, pretreatment unit three and pretreatment unit four;
Described intermediate layer includes temporary location one, temporary location two and temporary location three;
Described output layer includes output unit;
Described input layer, pretreatment layer, intermediate layer and output layer are respectively the 1st layer of BP neutral net, the 2nd layer, the 3rd layer
With the 4th layer;
Described input block one, input block two, input block three, input block four and input block five are respectively the 1st
Unit the 1st, Unit the 2nd, Unit the 3rd, Unit the 4th and Unit the 5th of layer;
Described pretreatment unit one, pretreatment unit two, pretreatment unit three and pretreatment unit four are respectively the 2nd layer
Unit the 1st, Unit the 2nd, Unit the 3rd and Unit the 4th;
Described temporary location one, temporary location two and temporary location three are respectively Unit the 1st, Unit the 2nd and the of the 3rd layer
Unit 3;
Described output unit is Unit the 1st of the 4th layer;
If the output valve of l layer i-th cell isBias term isActivation primitive is fi (l)(), the unit of l layer
Sum is n(l), the output valve of l layer jth unitWeights when being transferred to l+1 layer i-th cell are
Then for the 1st layer:
For the 2nd to 4 layer:
IfWithPerseverance is 0;
According to the data of input, BP neutral net judges whether this element is invalid voice data slot, if invalid language
This element is then replaced with blank sound data by sound data slot;
(C) speech-sound data sequence that replacement was processed by discrimination module is sent in L2 cache.
Further: the activation primitive of described pretreatment layer each unit is:
Further: the activation primitive of described intermediate layer and output layer each unit is:
fi (l)(x)=max (0, x-0.2).
Further, the training method of BP neutral net is: background noise frequency be 20Hz, 80Hz, 150Hz,
200Hz, 360Hz, 500Hz, 750Hz, 1000Hz, 1500Hz, 2000Hz and each all without recording 100 in the environment of voice respectively
The invalid sample voice data of bar duration 30s, and background noise frequency be 20Hz, 80Hz, 150Hz, 200Hz, 360Hz,
500Hz, 750Hz, 1000Hz, 1500Hz, 2000Hz and be each respectively provided with in the environment of voice and record 100 duration 30s respectively
Effective sample speech data;2000 sample speech datas are divided into speech-sound data sequence the most respectively in units of 3s,
Carry out 20000 elements of all speech-sound data sequences out of order being arranged to make up sample sequence, be successively read in sample sequence
Element: for each element, remembers that the ensemble average decibel value of this element is x1, overall code check be x2, this element is carried out frequency domain
Analyze, with 50Hz as starting point, calculate the rate of change often crossing 50Hz decibel value, record first rate of change frequency more than 0.1dB/Hz
Rate value is x3, first rate of change frequency values less than-0.1dB/Hz be x4If not finding qualified x3, then by x3Set
For 50Hz, if not finding qualified x4, then by x4It is set as 1200Hz, calculates x3To x4The average decibels of frequency range is x5,
By x1、x2、x3、x4And x5As one group of training sample input data;20000 groups of training sample input data are combined each pantogen
Corresponding invalidating expected outcome, to BP neural metwork training, keeps during training
WithPerseverance is 0.
Further: described main control chip is also associated with FM module.
Further: described main control chip is also associated with display screen.
Further: described main control chip is also associated with memory module.
Relative to prior art, the invention have the advantages that (1) present invention has discrimination module based on FPGA, energy
Whether enough utilize the neural network algorithm trained to determine according to the characteristic information of speech data fragment is that voice content is blank
Invalid voice data slot, and invalid voice data slot is replaced with blank sound data, have compressed transmitting audio data
Size, reduces the calling extension set demand to bandwidth;(2) this detection method utilizes neutral net to differentiate speech data,
There is the advantage that None-linear approximation ability is strong, judging efficiency is high and accuracy rate is high;(3) neutral net introduces pretreatment layer,
Owing to everyone voice frequency range is more concentrated, therefore part flexible strategy are carried out by pretreatment layer forcing to set, and will
First rate of change is more than the frequency values x of 0.1dB/Hz3With the frequency values x that first rate of change is less than-0.1dB/Hz4Both
Dependency is more apparent but that cannot be completely integrated characteristic information has carried out incomplete merging treatment, the most again by pretreatment
The result of layer exports in intermediate layer, it is ensured that x during follow-up calculating3And x4All the time possess certain dependency, thus carry
The high accuracy of judged result, also improves the efficiency of training simultaneously;(4) activation primitive of pretreatment layer sets and takes into full account
X3And x4Two incomplete merging treatment of characteristic information are in terms of computational efficiency, differential solve difficulty and dependency reservation
Requirement, have solve, training effectiveness is high and judgment accuracy is high advantage.
Accompanying drawing explanation
Fig. 1 is the structural representation of Multifunctional calling extension set proposed by the invention.
Fig. 2 is the structural representation of BP neutral net.
Detailed description of the invention
Describe technical scheme below in conjunction with the accompanying drawings in detail:
Such as Fig. 1, Multifunctional calling extension set, including main control chip, also include the wheat being each connected respectively with main control chip
Gram wind, speaker and receiver module, described main control chip connects also by level cache discrimination module, described discrimination module
Connected by L2 cache and have sending module;Described main control chip is also associated with FM module, LCDs and memory module, institute
State memory module can be built-in storage chip can also be to be accessed the flash disk device of this calling extension set by USB interface.
Patient can pass through this calling extension set and host communication, it is also possible to listens to FM by speaker and broadcasts, or appreciates storage
Music in module.LCDs may also display the relevant informations such as the name of patient, age.
Described discrimination module, based on fpga chip, has been built in described fpga chip for differentiating invalid voice data slot
BP neutral net, described BP neutral net differentiate invalid voice data slot method step be:
(A) voice that mike is included by main control chip is converted into speech data, it is contemplated that voice frequency range be 65 to
1100Hz, all filters the frequency range of below 50Hz and more than 1200Hz in this speech data, then the speech data after filtering
In units of 3s, it is divided into speech-sound data sequence and this speech-sound data sequence is left in level cache;
Main control chip is successively read the element in speech-sound data sequence from level cache, is handled as follows respectively:
(A-1) the ensemble average decibel value remembering this element is x1, overall code check be x2;
(A-2) this element is carried out frequency-domain analysis, with 50Hz as starting point, calculate the rate of change often crossing 50Hz decibel value, note
Recording first rate of change frequency values more than 0.1dB/Hz is x3, first rate of change frequency values less than-0.1dB/Hz be x4;
If not finding qualified x3, then by x3It is set as 50Hz, if not finding qualified x4, then by x4It is set as 1200Hz;
(A-3) x is calculated3To x4The average decibel value of frequency range is x5;
(A-4) by x1、x2、x3、x4And x5Store to level cache as one group of input data;
(B) it is sequentially delivered in the BP neutral net of discrimination module sentence by each group input data in level cache
Not;
This BP neutral net is disposed with input layer, pretreatment layer, intermediate layer and output layer along input to outbound course;
Described input layer includes for inputting x1Input block one, be used for inputting x2Input block two, be used for inputting x3
Input block three, be used for inputting x4Input block four and be used for inputting x5Input block five;
Described pretreatment layer includes pretreatment unit one, pretreatment unit two, pretreatment unit three and pretreatment unit four;
Described intermediate layer includes temporary location one, temporary location two and temporary location three;
Described output layer includes output unit;
Described input layer, pretreatment layer, intermediate layer and output layer are respectively the 1st layer of BP neutral net, the 2nd layer, the 3rd layer
With the 4th layer;
Described input block one, input block two, input block three, input block four and input block five are respectively the 1st
Unit the 1st, Unit the 2nd, Unit the 3rd, Unit the 4th and Unit the 5th of layer;
Described pretreatment unit one, pretreatment unit two, pretreatment unit three and pretreatment unit four are respectively the 2nd layer
Unit the 1st, Unit the 2nd, Unit the 3rd and Unit the 4th;
Described temporary location one, temporary location two and temporary location three are respectively Unit the 1st, Unit the 2nd and the of the 3rd layer
Unit 3;
Described output unit is Unit the 1st of the 4th layer;
If the output valve of l layer i-th cell isBias term isActivation primitive is fi (l)(), the unit of l layer
Sum is n(l), the output valve of l layer jth unitWeights when being transferred to l+1 layer i-th cell are
Then for the 1st layer:
For the 2nd to 4 layer:
IfWithPerseverance is 0, and this reason arranged is: each
The voice frequency range of people is typically more concentrated, i.e. x3With x4Difference will not be excessive, therefore in pretreatment layer carries out part flexible strategy
Force to set, and by x3And x4Both dependencys are more apparent but that cannot be completely integrated characteristic information has been carried out non-fully
The merging treatment of property, exports in intermediate layer by the result of pretreatment layer, it is ensured that x during follow-up calculating the most again3And x4
All the time possess certain dependency, thus improve the accuracy of judged result, also improve the efficiency of training simultaneously;
According to the data of input, BP neutral net judges whether this element is invalid voice data slot, if invalid language
This element is then replaced with blank sound data by sound data slot;
(C) speech-sound data sequence that replacement was processed by discrimination module is sent in L2 cache.
The activation primitive of described pretreatment layer each unit is:
The setting of this activation primitive has taken into full account x3And x4Effect is being calculated after two incomplete merging treatment of characteristic information
Rate, differential solve the requirement in terms of difficulty and dependency reservation, have solve, that training effectiveness is high and judgment accuracy is high is excellent
Point;
The activation primitive of described intermediate layer and output layer each unit is:
fi (l)(x)=max (0, x-0.2).
The training method of this BP neutral net is: background noise frequency be 20Hz, 80Hz, 150Hz, 200Hz, 360Hz,
500Hz, 750Hz, 1000Hz, 1500Hz, 2000Hz and each all without recording 100 duration 30s' in the environment of voice respectively
Invalid sample voice data, and background noise frequency be 20Hz, 80Hz, 150Hz, 200Hz, 360Hz, 500Hz, 750Hz,
1000Hz, 1500Hz, 2000Hz and be each respectively provided with the effective sample language recording 100 duration 30s in the environment of voice respectively
Sound data;During sampling, different environment should be selected as far as possible, and the personnel arranging sound ray mutually different participate in the record of voice
System;2000 sample speech datas are divided into speech-sound data sequence the most respectively in units of 3s, by all speech data sequences
20000 elements of row carry out out of order being arranged to make up sample sequence, the element being successively read in sample sequence: for every unitary
Element, remembers that the ensemble average decibel value of this element is x1, overall code check be x2, this element is carried out frequency-domain analysis, with 50Hz for rising
Point, calculates the rate of change often crossing 50Hz decibel value, and recording first rate of change frequency values more than 0.1dB/Hz is x3, first
The rate of change frequency values less than-0.1dB/Hz is x4If not finding qualified x3, then by x3It is set as 50Hz, if not finding
Qualified x4, then by x4It is set as 1200Hz, calculates x3To x4The average decibels of frequency range is x5, by x1、x2、x3、x4And x5
As one group of training sample input data;20000 groups of training sample input data are combined the effective/nothing corresponding to each pantogen
Effect expected outcome, to BP neural metwork training, keeps during trainingWithPerseverance is 0.
Patient, due to reasons such as health status, occurs that in communication the sound of stage rests often, thus the voice of transmission
Data volume is relatively big, occupies substantial portion of bandwidth.In order to the fragment that voice content in speech data is blank is removed, with
Reducing the demand to bandwidth, this programme is by using discrimination module, and it is invalid to determine according to the characteristic information of speech data fragment
Speech data fragment, is led to by L2 cache after then invalid voice data slot replacing with the blank sound data that volume is minimum again
Cross sending module to transmit to main frame, have compressed the size of transmitting audio data, reduce the calling extension set demand to bandwidth.