CN110225445A - A kind of processing voice signal realizes the method and device of three-dimensional sound field auditory effect - Google Patents
A kind of processing voice signal realizes the method and device of three-dimensional sound field auditory effect Download PDFInfo
- Publication number
- CN110225445A CN110225445A CN201910428458.XA CN201910428458A CN110225445A CN 110225445 A CN110225445 A CN 110225445A CN 201910428458 A CN201910428458 A CN 201910428458A CN 110225445 A CN110225445 A CN 110225445A
- Authority
- CN
- China
- Prior art keywords
- mobile terminal
- voice signal
- sound
- signal
- sound field
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 28
- 230000000694 effects Effects 0.000 title claims abstract description 20
- 230000005236 sound signal Effects 0.000 claims abstract description 42
- 239000011159 matrix material Substances 0.000 claims abstract description 12
- 230000009466 transformation Effects 0.000 claims abstract description 12
- 238000005516 engineering process Methods 0.000 description 16
- 230000002708 enhancing effect Effects 0.000 description 8
- 238000004088 simulation Methods 0.000 description 6
- 210000004556 brain Anatomy 0.000 description 5
- 210000005069 ears Anatomy 0.000 description 4
- 240000006409 Acacia auriculiformis Species 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 238000001914 filtration Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 238000010295 mobile communication Methods 0.000 description 3
- 210000003454 tympanic membrane Anatomy 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 230000006854 communication Effects 0.000 description 2
- 210000003128 head Anatomy 0.000 description 2
- 230000004807 localization Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 241000208340 Araliaceae Species 0.000 description 1
- 206010019233 Headaches Diseases 0.000 description 1
- 208000002193 Pain Diseases 0.000 description 1
- 235000005035 Panax pseudoginseng ssp. pseudoginseng Nutrition 0.000 description 1
- 235000003140 Panax quinquefolius Nutrition 0.000 description 1
- 208000013738 Sleep Initiation and Maintenance disease Diseases 0.000 description 1
- 208000009205 Tinnitus Diseases 0.000 description 1
- 210000000860 cochlear nerve Anatomy 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000003111 delayed effect Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 210000000883 ear external Anatomy 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 235000008434 ginseng Nutrition 0.000 description 1
- 231100000869 headache Toxicity 0.000 description 1
- 206010022437 insomnia Diseases 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000004377 microelectronic Methods 0.000 description 1
- 210000000653 nervous system Anatomy 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000002310 reflectometry Methods 0.000 description 1
- 230000033458 reproduction Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 230000001360 synchronised effect Effects 0.000 description 1
- 231100000886 tinnitus Toxicity 0.000 description 1
- 230000017105 transposition Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/302—Electronic adaptation of stereophonic sound system to listener position or orientation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/305—Electronic adaptation of stereophonic audio signals to reverberation of the listening space
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Circuit For Audible Band Transducer (AREA)
- Stereophonic System (AREA)
Abstract
The invention discloses the method and devices that a kind of processing voice signal realizes three-dimensional sound field auditory effect, this method comprises: from the voice signal obtained in multiple directions in three-dimensional sound field;According to the voice signal, direction of the sound source relative to mobile terminal is determined;Direction according to the voice signal and the sound source relative to the mobile terminal generates spatial audio signal;The spatial audio signal is for simulating the three-dimensional sound field;Determine that the mobile terminal has the number of Microspeaker;It is decoded according to each sound channel of the audio decoder to multichannel program;The lower mixed scale factor and transformation matrix provided according to encoder, which indexes, to be routed to each sound channel in output channels;It is lower mixed according to the number for determining Microspeaker, it is sent to the output configuration of corresponding sound channel.This method can promote impression of the human auditory to moving three-dimensional object, and then provide more lively three-dimensional environment, promote the quality of simulated three-dimensional sound field.
Description
Technical field
The present invention relates to audio signal technical field, in particular to a kind of processing voice signal realizes three-dimensional sound field sense of hearing effect
The method and device of fruit.
Background technique
Mobile terminal makes mobile communication terminal refer to that the computer equipment that can be used on the move, broad sense say packet
Including mobile phone, notebook, tablet computer, POS machine includes even vehicle-mounted computer.But refers to mobile phone in most cases or have
The smart phone and tablet computer of a variety of application functions.As network and technology are towards the hair in more and more broadband direction
Exhibition, Mobile Communication Industry will move towards the real mobile message epoch.
Earphone is that people are brought convenience using mobile terminal, while also bringing harm.After people puts on earphone, external ear is almost located
In blocked state.The audio sound straightening of louder volume taps into pleasant, and concentration is transmitted on very thin ear-drum.Meanwhile insert earphone shakes
Distance is close between dynamic film and ear-drum, and the range of Acoustic Wave Propagation is small and concentrates, bigger to the stimulation of ear-drum auditory nerve.Time
It is long, Yi Yinqi tinnitus, insomnia, headache, ear muffle distending pain and gradual dysacousis.It, can be again if there is kind of a Microspeaker
The original three-dimensional sound field of existing audio, can greatly improve the user experience of mobile terminal.
Up to the present there are no its unified definition in the world by MEMS (MEMS), but in general, MEMS
Referring to can be using the manufacture of microelectronics technique for manufacturing batch, collection micro mechanism, microsensor, micro-actuator and signal
The components such as processing and control circuit, interface, communication and power supply are in the microsystem of one.Microspeaker is the one of MEMS technology
Kind concrete application.
Microspeaker is that have small-sized, thin type structure micro- electroacoustic transducer, is used under free found field state, by sound
Frequency electric signal is changed by the transposition mode of electroacoustic is distorted audible sound small and with enough sound pressure levels, is widely used in moving
Mobile communication terminal.
Audiovisual has entered the three-dimensional epoch, to reach better audiovisual experience, needs synchronous with 3 d video content
Three-dimensional sound field auditory effect can just be really achieved hearing experience on the spot in person.NHK company, Japan is proposed 22.2 sound in recent years
Road system, three-dimensional sound field that can be original by 24 loudspeaker reproductions.MPEG in 2011 sets about formulating the world of three-dimensional audio
Standard wishes to remove reduction three-dimensional sound field by comparing few loudspeaking while reaching certain code efficiency, so that energy should
Technique Popularizing is to mobile terminal.
Summary of the invention
The object of the present invention is to provide a kind of methods that processing voice signal realizes three-dimensional sound field auditory effect, can mention
Impression of the human auditory to moving three-dimensional object is risen, and then more lively three-dimensional environment is provided, promotes simulated three dimensional sound
The quality of field.
In order to solve the above-mentioned technical problem, in a first aspect, the embodiment of the present invention provides a kind of processing voice signal realization three
The method for tieing up sound field auditory effect is used for mobile terminal, comprising:
From the voice signal obtained in three-dimensional sound field in multiple directions;
According to the voice signal, direction of the sound source relative to mobile terminal is determined;
Direction according to the voice signal and the sound source relative to the mobile terminal generates space audio letter
Number;The spatial audio signal is for simulating the three-dimensional sound field;
Determine that the mobile terminal has the number of Microspeaker;
It is decoded according to each sound channel of the audio decoder to multichannel program;
The lower mixed scale factor and transformation matrix provided according to encoder, which is indexed, is routed to output sound each sound channel
In road;
It is lower mixed according to the number for determining Microspeaker, it is sent to the output configuration of corresponding sound channel.
In one embodiment, according to the voice signal, direction of the sound source relative to mobile terminal is determined, comprising:
It obtains on voice signal and another direction that the microphone on the mobile terminal receives on a direction
Reaching time-difference between voice signal, the sound that the microphone on the mobile terminal is used to receive at least four directions are believed
Number;
According to the position of the microphone in acquired reaching time-difference and the mobile terminal on the mobile terminal,
Determine direction of the sound source relative to the mobile terminal.
In one embodiment, obtain the microphone on the mobile terminal receive voice signal on a direction with
The reaching time-difference between voice signal on another direction, comprising:
According to audio location algorithm, time difference and the intensity difference of sound position are determined.
Second aspect, the present invention also provides the devices that a kind of processing voice signal realizes three-dimensional sound field auditory effect, comprising:
Obtain module, for from three-dimensional sound field obtain multiple directions on voice signal;
First determining module, for determining direction of the sound source relative to mobile terminal according to the voice signal;
Generation module, it is raw for the direction according to the voice signal and the sound source relative to the mobile terminal
At spatial audio signal;The spatial audio signal is for simulating the three-dimensional sound field;
Second determining module, for determining that the mobile terminal has the number of Microspeaker;
Decoder module, for being decoded according to each sound channel of the audio decoder to multichannel program;
Routing module, lower mixed scale factor and transformation matrix for being provided according to encoder are indexed each sound channel
It is routed in output channels;
Sending module, for lower mixed according to the number for determining Microspeaker, the output for being sent to corresponding sound channel is configured.
In one embodiment, first determining module, comprising:
Acquisition submodule, for obtain the microphone on the mobile terminal receive voice signal on a direction with
The reaching time-difference between voice signal on another direction, the microphone on the mobile terminal is for receiving at least four
Voice signal on direction;
Determine submodule, for according to the microphone in acquired reaching time-difference and the mobile terminal in the shifting
Position in dynamic terminal, determines direction of the sound source relative to the mobile terminal.
In one embodiment, the acquisition submodule is specifically used for determining sound position according to audio location algorithm
Time difference and intensity difference.
It is an advantage of the current invention that the method that a kind of processing voice signal of the invention realizes three-dimensional sound field auditory effect,
This method can promote impression of the human auditory to moving three-dimensional object, and then provide more lively three-dimensional environment, promote institute's mould
The quality of quasi- three-dimensional sound field.
Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification
It obtains it is clear that understand through the implementation of the invention.The objectives and other advantages of the invention can be by written explanation
Specifically noted structure is achieved and obtained in book, claims and attached drawing.
Below by drawings and examples, technical scheme of the present invention will be described in further detail.
Detailed description of the invention
Attached drawing is used to provide further understanding of the present invention, and constitutes part of specification, with reality of the invention
It applies example to be used to explain the present invention together, not be construed as limiting the invention.In the accompanying drawings:
Fig. 1 is the method flow diagram that processing voice signal provided in an embodiment of the present invention realizes three-dimensional sound field auditory effect;
Fig. 2 is that source of students provided in an embodiment of the present invention propagates schematic diagram;
Fig. 3 is the device block diagram that processing voice signal provided in an embodiment of the present invention realizes three-dimensional sound field auditory effect.
Specific embodiment
Exemplary embodiments of the present disclosure are described in more detail below with reference to accompanying drawings.Although showing the disclosure in attached drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here
It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure
It is fully disclosed to those skilled in the art.
The embodiment of the invention provides a kind of methods that processing voice signal realizes three-dimensional sound field auditory effect, for moving
Terminal, it is shown referring to Fig.1, comprising:
S11, from three-dimensional sound field obtain multiple directions on voice signal;
S12, according to the voice signal, determine direction of the sound source relative to mobile terminal;
S13, the direction according to the voice signal and the sound source relative to the mobile terminal generate space audio
Signal;The spatial audio signal is for simulating the three-dimensional sound field;
S14, determine that the mobile terminal has the number of Microspeaker;
S15, it is decoded according to each sound channel of the audio decoder to multichannel program;
S16, the lower mixed scale factor provided according to encoder and transformation matrix index are routed to each sound channel defeated
In sound channel;
S17, the lower mixed number according to determining Microspeaker, are sent to the output configuration of corresponding sound channel.
In the present embodiment, the embodiment of the present invention can be applied to a kind of mobile terminal, be disposed with microphone on mobile terminal,
Microphone obtains from three-dimensional sound field voice signal and by transmission of sound signals to movement for measuring to three-dimensional sound field
Processor in terminal carries out enhancing processing, and can be by the transmission of sound signals of enhancing front and back to the storage in mobile terminal
Device is stored.
Specifically, the microphone arranged on mobile terminal can be the microphone of full directing, it is also possible to have certain
The microphone of directive property, for example, can specifically arrange MEMS (Micro-Electro- on mobile terminals
MechanicalSystem, MEMS) microphone or ECM (ElectretCondenserMicrophones, electret
Body capacitance microphone) microphone.
In the present embodiment, the arrangement of microphone on mobile terminals can there are many, in different arrangements
The quantity of microphone on middle mobile terminal and position do not limit, and choose 4 microphones and 3 microphones in the present embodiment
In case where be illustrated.Such as: 4 microphones can be separately positioned on to four angles of mobile terminal on mobile terminals
On.4 microphones can also be separately positioned on four sides of mobile terminal.It can also bottom edge, front in mobile terminal
Earpiece beside, back side camera nearby and back side bottom edge nearby respectively setting one microphone;Again for example: can be mobile whole
3 microphones are only set on end.
In step S12, according to acquired voice signal, direction of the sound source relative to the mobile terminal is obtained.Reference
Shown in Fig. 2, the sound issued from sound source can be propagate directly to left and right ear, but because left ear is close from sound source, sound
Left ear can first be reached and reach auris dextra again, due to the decaying in communication process, the sound that left ear is heard is bigger than auris dextra, this is straight
The voice signal connect, brain can receive the two ears and be transmitted through the signal come.Meanwhile it also can be by surrounding from the sound that sound source issues
Object reflection, these reflectivity direct signals have certain delay and volume is smaller, these are indirect voice signals.Greatly
Brain can collect direct signal and all indirect signals and compared with the signal that is acquired from left ear and auris dextra, be calculated analytically,
To achieve the effect that location sound source.Understand brain operating mode after, we can by control two sound equipments or
The volume of earphone achievees the effect that simulation 3D sound source with delay, and brain is allowed to produce virtual 3D sound scenery.
Mobile terminal is estimated by microphone sound source is issued, and obtains direction of the sound source relative to mobile terminal.
In the present embodiment, method sound source estimated can there are many, such as the steerable beam shape based on peak power output
At technology, or the location technology based on reaching time-difference, it can also be the location technology etc. based on High-Resolution Spectral Estimation.
Also, mobile terminal can also carry out the voice signal received while estimating sound source position
Enhancing processing.Such as: mobile terminal can use the technological means such as Wave beam forming, spatial prediction, auditory scene analysis and obtain respectively
Voice signal after a orientation enhancement.Such as: mobile terminal can wrap by the specific method that Wave beam forming enhances voice signal
It includes: the fixed beams formation technologies such as delay is added Wave beam forming, filtering is added, or it is quasi- based on minimum variance distortionless response
The Adaptive beamformers such as adaptive beam-forming algorithm, linear constraint minimal variance Wave beam forming, secondary lobe cancellation algorithms then
Technology can also be Difference Beam formation technology;Mobile terminal can be with by the specific method that spatial prediction enhances voice signal
Include: it is expected collected space acoustical signal on certain directions by presetting, then passes through preparatory trained optimal filter
Wave device group, the voice signal in all directions for being received the sound receiver on mobile terminal using spatial prediction techniques are turned
It changes into it is expected output signal on preset certain directions, so that the noise of the enhanced voice signal of output is most
Small and prediction error levels off to zero;In the present embodiment, auditory scene analysis technology specifically can be blind source separation algorithm.
Enhancing obtains the voice signal in at least both direction with beam direction, is received by a microphone
The voice signal having on one direction of beam direction that voice signal is handled by enhancing, such as: it can divide into
The voice signal in all directions that the different directions sound source such as front, rear, left side, right side is sent to mobile terminal.Such as: it moves
The voice signal that dynamic end-on receives generates the enhanced voice signal of four direction after carrying out enhancing processing, is respectively provided with a left side
Front side, forward right side, left rear side, right lateral side beam direction;Also the enhanced voice signal difference of four direction can be generated
With front side, rear side, left side, right side beam direction.It in the present embodiment, can also according to specific needs, by multiple sides
The voice signal on some assigned direction is synthesized to the voice signal of being differently directed property, and is had by what enhancing was handled
The beam shape of the voice signal of beam direction can be directed toward to be heart-shaped, be also possible to the other shapes such as super heart.103, root
Direction and acquired voice signal according to the sound source relative to the mobile terminal, obtain spatial audio signal.
Wherein, for simulating the three-dimensional sound field, the three-dimensional sound field can be understood as moving the spatial audio signal
The sound field of terminal surrounding in a certain range, sound source can make a sound signal from three-dimensional sound field any direction, and by mobile whole
End receives.
Such as: mobile terminal is believed using sound source relative to the sound on the direction of mobile terminal and all directions received
Number, the spatial audio signal with 5.1 channel playback systems simulation sound field is generated, mobile terminal can be each by what is obtained after enhancing
Voice signal on direction is mapped as needed for the spatial audio signal for forming 5.1 channel playback systems simulation sound field in total
Voice signal on six direction, and the direction using sound source relative to mobile terminal are further promoted with 5.1 channel playback systems
The separating degree of voice signal needed for the spatial audio signal of system simulation sound field on six direction, such as: mobile terminal can be with
Direction according to sound source relative to mobile terminal calculates the space audio that sound field is simulated in each direction with 5.1 channel playback systems
The gain adjustment parameter of voice signal needed for signal, and simulated using gain adjustment parameter 5.1 channel playback systems of adjustment
Voice signal needed for the spatial audio signal of sound field.Spatial audio signal is including at least the signal on left direction, right side side
Upward signal, the signal in middle side direction, left back circular signal, it is right after around signal.
Wherein, the voice signal in all directions that mobile terminal receives simulates sound field with 5.1 channel playback systems
Spatial audio signal needed for corresponding relationship between voice signal on six direction can there are many.Such as: to movement
The voice signal that terminal receives is enhanced and is exported the voice signal on four direction, it is respectively left front, left back, right before with
And behind the right side;And the upward voice signal in left front is mapped as to simulate the spatial audio signal of sound field with 5.1 channel playback systems
Voice signal on required left direction;The upward voice signal in right front is mapped as being simulated with 5.1 channel playback systems
Voice signal on right direction needed for the spatial audio signal of sound field;According to the upward voice signal in left front and right front
Upward voice signal seeks average signal and is mapped as average signal to simulate the space sound of sound field with 5.1 channel playback systems
Signal in middle side direction needed for frequency signal;The upward voice signal in left back is mapped as with 5.1 channel playback system moulds
Left back surround sound signal needed for the spatial audio signal of onomatopoeia field;The upward voice signal in right back is mapped as with 5.1
Surround sound signal behind the right side needed for channel playback system simulates the spatial audio signal of sound field;According to left front direction, left back
The low-pass filtering treatment of 150Hz is carried out to, right front to the voice signal averaged upward with right back and to average value,
Obtain the bass signal of weight needed for simulating the spatial audio signal of sound field with 5.1 channel playback systems.
Again for example: enhanced the voice signal that mobile terminal receives and exported the voice signal on four direction,
Voice signal respectively on front side, rear side, left side and right side four direction;And by left direction and the upward sound in front side
The average signal of sound signal is mapped as on left direction needed for simulating the spatial audio signal of sound field with 5.1 channel playback systems
Voice signal;The average signal of right direction and the upward voice signal in front side is mapped as with 5.1 channel playback systems
Voice signal on right direction needed for simulating the spatial audio signal of sound field;The upward voice signal in front side is mapped as
Signal in middle side direction needed for spatial audio signal with 5.1 channel playback systems simulation sound field;By left direction with after
The average signal of voice signal in side direction is mapped as needed for the spatial audio signal for simulating sound field with 5.1 channel playback systems
Left back surround sound signal;The average signal of voice signal on right direction and rear direction is mapped as with 5.1 sound channels
Surround sound signal behind the right side needed for playback system simulates the spatial audio signal of sound field;According to front side direction, rear direction, a left side
Voice signal averaged in side direction and right direction and the low-pass filtering treatment that 150Hz is carried out to average value, obtain
The bass signal of weight needed for spatial audio signal with 5.1 channel playback systems simulation sound field.
The method of processing voice signal provided in an embodiment of the present invention, can obtain sound from mobile terminal three-dimensional sound field
Signal;And obtain direction of each sound source relative to mobile terminal;Recycle direction and sound of the sound source relative to mobile terminal
Signal obtains the spatial audio signal for simulating three-dimensional sound field.
Scheme provided by the invention can be acquired by the element of mobile terminal itself and be handled for simulating three-dimensional sound field
Voice signal, and direction of the sound source in received all directions relative to mobile terminal is analyzed, in conjunction with each
The effect of sound source on a direction relative to the orientation enhancement three-dimensional sound field of mobile terminal, compared with the existing technology in merely with wave
Beam simulates the scheme of three-dimensional sound field, due to obtaining direction of the sound source in all directions relative to mobile terminal in the present invention,
And further gain adjustment is carried out to Wave beam forming using direction information, therefore the three dimensional sound for simulating and can be extenuated
The differentiation of the left and right of the field phenomenon more obvious than front and back, to promote the quality of simulated three-dimensional sound field.
In the present embodiment, acquisition for mobile terminal sound source can have more relative to the mode in the direction of the mobile terminal
Kind, for example can be obtained using location technology as the mobile terminal of sound source and the space coordinate of mobile terminal, and according to conduct
The mobile terminal of sound source and the space coordinate of mobile terminal determine direction of the sound source relative to the mobile terminal.But in movement
Terminal needs to occupy network bandwidth during being positioned, and position fixing process has certain delay, and in the present embodiment,
Mobile terminal needs to obtain direction of the sound source in multiple directions relative to mobile terminal, can be by based on reaching time-difference
Location technology obtains direction of the sound source relative to the mobile terminal.
In order to simulate the interaction between true sound wave and ear, resonate audio (Resonance Audio) technology
Head has been used to be associated with propagation function (Head-Related Transfer Functions, HRTFs) audio location algorithm.
HRTFs includes the influence for the time difference and intensity difference that determine sound position, and the frequency spectrum effect for determining sound position
Answer, the algorithm calculate sound wave pass through after transmitting, reflection head, ear various effects, how the nervous system for simulating people to go
Judge the vertical height (elevation) of sound source position, especially sound source.The sense of hearing audio of HRTFs processing is used by earphone
User's brain is set to generate a kind of illusion, i.e., having a specific position in virtual world of the sound around them, (this just reaches
The target of the simulator locating of sound source).In addition to this, resonance audio (Resonance Audio) technology can not only simulate sound wave
With ear interaction, moreover it is possible to simulate the interaction of sound wave Yu its ambient enviroment.
Sound localization method based on microphone array can substantially be divided into three classes: the controllable wave based on peak power output
Beam formed technology, based on high-resolution spectrogram estimation technique and based on sound time it is poor (time-delay estimation,
TDE auditory localization technology).
Algorithm core based on TDE is the accurate estimation to propagation delay, generally by doing mutually between signal microphone
Relevant treatment obtains.Sound source position information is further obtained, can be calculated by the summation that is simply delayed, geometry or directly sharp
The methods of controlled power response search is carried out with cross correlation results.This kind of algorithm realization is relatively easy, and operand is small, convenient for real-time
Processing, therefore in practice with most wide.
In addition, audio decoder can carry out lower mixed, this is needed when output channels number is fewer than coding channels number.This
Sample, complete multichannel program can be reset in less sound channel.Decoder is decoded each sound channel, and uses coding
These sound channels are routed in output channels (these scale factors setting by lower mixed scale factor and the transformation matrix index that device provides
The relative level of each sound channel, and corresponding transformation inverse matrix is obtained based on transformation matrix index, also using transformation inverse matrix
Sound channel unit after former corresponding quantization).It is lower mixed can be sent to 3 to 8 sound channels output configuration (3 to 8 sound channels it is defeated
Configuration is suitable for mobile terminal out).
The number of Microspeaker can be determined according to the size of mobile terminal;Audio decoder is each to multichannel program
A sound channel is decoded;And the lower mixed scale factor and transformation matrix provided using encoder index these sound channels are routed to it is defeated
In sound channel;It is lower mixed according to determining Microspeaker number, it is sent to the output configuration of corresponding sound channel.
Second aspect, the present invention also provides the device that a kind of processing voice signal realizes three-dimensional sound field auditory effect, references
Shown in Fig. 3, comprising:
Obtain module 31, for from three-dimensional sound field obtain multiple directions on voice signal;
First determining module 32, for determining direction of the sound source relative to mobile terminal according to the voice signal;
Generation module 33, for the direction according to the voice signal and the sound source relative to the mobile terminal,
Generate spatial audio signal;The spatial audio signal is for simulating the three-dimensional sound field;
Second determining module 34, for determining that the mobile terminal has the number of Microspeaker;
Decoder module 35, for being decoded according to each sound channel of the audio decoder to multichannel program;
Routing module 36, lower mixed scale factor and transformation matrix for being provided according to encoder are indexed each sound
Road is routed in output channels;
Sending module 37, for lower mixed according to the number for determining Microspeaker, the output for being sent to corresponding sound channel is configured.
In one embodiment, first determining module 32, comprising:
Acquisition submodule 321, for obtain the microphone on the mobile terminal receive on a direction sound letter
Number and another direction on voice signal between reaching time-difference, the microphone on the mobile terminal is for receiving at least
Voice signal on four direction;
Determine submodule 322, for according to the microphone in acquired reaching time-difference and the mobile terminal in institute
The position on mobile terminal is stated, determines direction of the sound source relative to the mobile terminal.
In one embodiment, the acquisition submodule 321 is specifically used for determining sound position according to audio location algorithm
The time difference set and intensity difference.
It should be noted last that the above examples are only used to illustrate the technical scheme of the present invention and are not limiting.Although ginseng
It is described the invention in detail according to embodiment, those skilled in the art should understand that, to technical side of the invention
Case is modified or replaced equivalently, and without departure from the spirit and scope of technical solution of the present invention, should all be covered in the present invention
Scope of the claims in.
Claims (6)
1. a kind of method that processing voice signal realizes three-dimensional sound field auditory effect, is used for mobile terminal, which is characterized in that packet
It includes:
From the voice signal obtained in three-dimensional sound field in multiple directions;
According to the voice signal, direction of the sound source relative to mobile terminal is determined;
Direction according to the voice signal and the sound source relative to the mobile terminal generates spatial audio signal;Institute
Spatial audio signal is stated for simulating the three-dimensional sound field;
Determine that the mobile terminal has the number of Microspeaker;
It is decoded according to each sound channel of the audio decoder to multichannel program;
The lower mixed scale factor and transformation matrix provided according to encoder, which indexes, to be routed to each sound channel in output channels;
It is lower mixed according to the number for determining Microspeaker, it is sent to the output configuration of corresponding sound channel.
2. the method as described in claim 1, which is characterized in that according to the voice signal, determine sound source relative to mobile whole
The direction at end, comprising:
It obtains the microphone on the mobile terminal and receives the voice signal on a direction and the sound on another direction
Reaching time-difference between signal, the microphone on the mobile terminal are used to receive the voice signal at least four directions;
According to the position of the microphone in acquired reaching time-difference and the mobile terminal on the mobile terminal, determine
Direction of the sound source relative to the mobile terminal.
3. method according to claim 2, which is characterized in that obtain the microphone on the mobile terminal and receive a side
The reaching time-difference between voice signal on upward voice signal and another direction, comprising:
According to audio location algorithm, time difference and the intensity difference of sound position are determined.
4. the device that a kind of processing voice signal realizes three-dimensional sound field auditory effect characterized by comprising
Obtain module, for from three-dimensional sound field obtain multiple directions on voice signal;
First determining module, for determining direction of the sound source relative to mobile terminal according to the voice signal;
Generation module generates empty for the direction according to the voice signal and the sound source relative to the mobile terminal
Between audio signal;The spatial audio signal is for simulating the three-dimensional sound field;
Second determining module, for determining that the mobile terminal has the number of Microspeaker;
Decoder module, for being decoded according to each sound channel of the audio decoder to multichannel program;
Routing module, lower mixed scale factor and transformation matrix index for being provided according to encoder route each sound channel
Into output channels;
Sending module, for lower mixed according to the number for determining Microspeaker, the output for being sent to corresponding sound channel is configured.
5. device as claimed in claim 4, which is characterized in that first determining module, comprising:
Acquisition submodule, for obtain the microphone on the mobile terminal receive voice signal on a direction with it is another
The reaching time-difference between voice signal on a direction, the microphone on the mobile terminal is for receiving at least four directions
On voice signal;
Submodule is determined, for whole in the movement according to the microphone in acquired reaching time-difference and the mobile terminal
Position on end determines direction of the sound source relative to the mobile terminal.
6. device as claimed in claim 5, which is characterized in that the acquisition submodule is specifically used for being positioned according to audio and calculate
Method determines time difference and the intensity difference of sound position.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910428458.XA CN110225445A (en) | 2019-05-22 | 2019-05-22 | A kind of processing voice signal realizes the method and device of three-dimensional sound field auditory effect |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910428458.XA CN110225445A (en) | 2019-05-22 | 2019-05-22 | A kind of processing voice signal realizes the method and device of three-dimensional sound field auditory effect |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110225445A true CN110225445A (en) | 2019-09-10 |
Family
ID=67821618
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910428458.XA Pending CN110225445A (en) | 2019-05-22 | 2019-05-22 | A kind of processing voice signal realizes the method and device of three-dimensional sound field auditory effect |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110225445A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113095258A (en) * | 2021-04-20 | 2021-07-09 | 深圳力维智联技术有限公司 | Directional signal extraction method, system, device and storage medium |
CN113281705A (en) * | 2021-04-28 | 2021-08-20 | 鹦鹉鱼(苏州)智能科技有限公司 | Microphone array device and mobile sound source audibility method based on same |
CN114286276A (en) * | 2021-12-22 | 2022-04-05 | 北京罗克维尔斯科技有限公司 | Vehicle sound field control method and device, electronic equipment and electric vehicle |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105227740A (en) * | 2014-06-23 | 2016-01-06 | 张军 | A kind of method realizing mobile terminal three-dimensional sound field auditory effect |
CN105451151A (en) * | 2014-08-29 | 2016-03-30 | 华为技术有限公司 | Method and apparatus for processing sound signal |
CN108206983A (en) * | 2016-12-16 | 2018-06-26 | 南京青衿信息科技有限公司 | The encoder and its method of the three-dimensional acoustical signal of compatible existing audio-visual system |
-
2019
- 2019-05-22 CN CN201910428458.XA patent/CN110225445A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105227740A (en) * | 2014-06-23 | 2016-01-06 | 张军 | A kind of method realizing mobile terminal three-dimensional sound field auditory effect |
CN105451151A (en) * | 2014-08-29 | 2016-03-30 | 华为技术有限公司 | Method and apparatus for processing sound signal |
CN108206983A (en) * | 2016-12-16 | 2018-06-26 | 南京青衿信息科技有限公司 | The encoder and its method of the three-dimensional acoustical signal of compatible existing audio-visual system |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113095258A (en) * | 2021-04-20 | 2021-07-09 | 深圳力维智联技术有限公司 | Directional signal extraction method, system, device and storage medium |
CN113281705A (en) * | 2021-04-28 | 2021-08-20 | 鹦鹉鱼(苏州)智能科技有限公司 | Microphone array device and mobile sound source audibility method based on same |
CN114286276A (en) * | 2021-12-22 | 2022-04-05 | 北京罗克维尔斯科技有限公司 | Vehicle sound field control method and device, electronic equipment and electric vehicle |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106664501B (en) | Systems, apparatus and methods for consistent acoustic scene reproduction based on informed spatial filtering | |
Pulkki | Spatial sound generation and perception by amplitude panning techniques | |
Bronkhorst | Localization of real and virtual sound sources | |
US6990205B1 (en) | Apparatus and method for producing virtual acoustic sound | |
US9674629B2 (en) | Multichannel sound reproduction method and device | |
Brown et al. | A structural model for binaural sound synthesis | |
CN1778143B (en) | Design tool for sound and image control device and sound and image control device | |
EP1551205A1 (en) | Head relational transfer function virtualizer | |
US9763020B2 (en) | Virtual stereo synthesis method and apparatus | |
KR20180135973A (en) | Method and apparatus for audio signal processing for binaural rendering | |
US7921016B2 (en) | Method and device for providing 3D audio work | |
Masiero | Individualized binaural technology: measurement, equalization and perceptual evaluation | |
CN108370485A (en) | Audio signal processor and method | |
CN110225445A (en) | A kind of processing voice signal realizes the method and device of three-dimensional sound field auditory effect | |
Lee et al. | HRTF measurement for accurate sound localization cues | |
KR100818660B1 (en) | 3D sound generator for short range model | |
CN114339582A (en) | Dual-channel audio processing method, directional filter generating method, apparatus and medium | |
KR20160136716A (en) | A method and an apparatus for processing an audio signal | |
Lee et al. | A real-time audio system for adjusting the sweet spot to the listener's position | |
Masiero | Individualized binaural technology | |
US10555105B2 (en) | Successive decompositions of audio filters | |
WO2019174442A1 (en) | Adapterization equipment, voice output method, device, storage medium and electronic device | |
EP2271136A1 (en) | Hearing device with virtual sound source | |
Choi | Extension of perceived source width using sound field reproduction systems | |
EP4489445A1 (en) | Audio device with head orientation-based filtering and related methods |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190910 |
|
RJ01 | Rejection of invention patent application after publication |