CN110225445A

CN110225445A - A kind of processing voice signal realizes the method and device of three-dimensional sound field auditory effect

Info

Publication number: CN110225445A
Application number: CN201910428458.XA
Authority: CN
Inventors: 张军
Original assignee: Shanghai DC Science Co Ltd
Current assignee: Shanghai DC Science Co Ltd
Priority date: 2019-05-22
Filing date: 2019-05-22
Publication date: 2019-09-10

Abstract

The invention discloses the method and devices that a kind of processing voice signal realizes three-dimensional sound field auditory effect, this method comprises: from the voice signal obtained in multiple directions in three-dimensional sound field；According to the voice signal, direction of the sound source relative to mobile terminal is determined；Direction according to the voice signal and the sound source relative to the mobile terminal generates spatial audio signal；The spatial audio signal is for simulating the three-dimensional sound field；Determine that the mobile terminal has the number of Microspeaker；It is decoded according to each sound channel of the audio decoder to multichannel program；The lower mixed scale factor and transformation matrix provided according to encoder, which indexes, to be routed to each sound channel in output channels；It is lower mixed according to the number for determining Microspeaker, it is sent to the output configuration of corresponding sound channel.This method can promote impression of the human auditory to moving three-dimensional object, and then provide more lively three-dimensional environment, promote the quality of simulated three-dimensional sound field.

Description

A kind of processing voice signal realizes the method and device of three-dimensional sound field auditory effect

Technical field

The present invention relates to audio signal technical field, in particular to a kind of processing voice signal realizes three-dimensional sound field sense of hearing effect The method and device of fruit.

Background technique

Mobile terminal makes mobile communication terminal refer to that the computer equipment that can be used on the move, broad sense say packet Including mobile phone, notebook, tablet computer, POS machine includes even vehicle-mounted computer.But refers to mobile phone in most cases or have The smart phone and tablet computer of a variety of application functions.As network and technology are towards the hair in more and more broadband direction Exhibition, Mobile Communication Industry will move towards the real mobile message epoch.

Earphone is that people are brought convenience using mobile terminal, while also bringing harm.After people puts on earphone, external ear is almost located In blocked state.The audio sound straightening of louder volume taps into pleasant, and concentration is transmitted on very thin ear-drum.Meanwhile insert earphone shakes Distance is close between dynamic film and ear-drum, and the range of Acoustic Wave Propagation is small and concentrates, bigger to the stimulation of ear-drum auditory nerve.Time It is long, Yi Yinqi tinnitus, insomnia, headache, ear muffle distending pain and gradual dysacousis.It, can be again if there is kind of a Microspeaker The original three-dimensional sound field of existing audio, can greatly improve the user experience of mobile terminal.

Up to the present there are no its unified definition in the world by MEMS (MEMS), but in general, MEMS Referring to can be using the manufacture of microelectronics technique for manufacturing batch, collection micro mechanism, microsensor, micro-actuator and signal The components such as processing and control circuit, interface, communication and power supply are in the microsystem of one.Microspeaker is the one of MEMS technology Kind concrete application.

Microspeaker is that have small-sized, thin type structure micro- electroacoustic transducer, is used under free found field state, by sound Frequency electric signal is changed by the transposition mode of electroacoustic is distorted audible sound small and with enough sound pressure levels, is widely used in moving Mobile communication terminal.

Audiovisual has entered the three-dimensional epoch, to reach better audiovisual experience, needs synchronous with 3 d video content Three-dimensional sound field auditory effect can just be really achieved hearing experience on the spot in person.NHK company, Japan is proposed 22.2 sound in recent years Road system, three-dimensional sound field that can be original by 24 loudspeaker reproductions.MPEG in 2011 sets about formulating the world of three-dimensional audio Standard wishes to remove reduction three-dimensional sound field by comparing few loudspeaking while reaching certain code efficiency, so that energy should Technique Popularizing is to mobile terminal.

Summary of the invention

The object of the present invention is to provide a kind of methods that processing voice signal realizes three-dimensional sound field auditory effect, can mention Impression of the human auditory to moving three-dimensional object is risen, and then more lively three-dimensional environment is provided, promotes simulated three dimensional sound The quality of field.

In order to solve the above-mentioned technical problem, in a first aspect, the embodiment of the present invention provides a kind of processing voice signal realization three The method for tieing up sound field auditory effect is used for mobile terminal, comprising:

From the voice signal obtained in three-dimensional sound field in multiple directions；

According to the voice signal, direction of the sound source relative to mobile terminal is determined；

Direction according to the voice signal and the sound source relative to the mobile terminal generates space audio letter Number；The spatial audio signal is for simulating the three-dimensional sound field；

Determine that the mobile terminal has the number of Microspeaker；

It is decoded according to each sound channel of the audio decoder to multichannel program；

The lower mixed scale factor and transformation matrix provided according to encoder, which is indexed, is routed to output sound each sound channel In road；

It is lower mixed according to the number for determining Microspeaker, it is sent to the output configuration of corresponding sound channel.

In one embodiment, according to the voice signal, direction of the sound source relative to mobile terminal is determined, comprising:

It obtains on voice signal and another direction that the microphone on the mobile terminal receives on a direction Reaching time-difference between voice signal, the sound that the microphone on the mobile terminal is used to receive at least four directions are believed Number；

According to the position of the microphone in acquired reaching time-difference and the mobile terminal on the mobile terminal, Determine direction of the sound source relative to the mobile terminal.

In one embodiment, obtain the microphone on the mobile terminal receive voice signal on a direction with The reaching time-difference between voice signal on another direction, comprising:

According to audio location algorithm, time difference and the intensity difference of sound position are determined.

Second aspect, the present invention also provides the devices that a kind of processing voice signal realizes three-dimensional sound field auditory effect, comprising:

Obtain module, for from three-dimensional sound field obtain multiple directions on voice signal；

First determining module, for determining direction of the sound source relative to mobile terminal according to the voice signal；

Generation module, it is raw for the direction according to the voice signal and the sound source relative to the mobile terminal At spatial audio signal；The spatial audio signal is for simulating the three-dimensional sound field；

Second determining module, for determining that the mobile terminal has the number of Microspeaker；

Decoder module, for being decoded according to each sound channel of the audio decoder to multichannel program；

Routing module, lower mixed scale factor and transformation matrix for being provided according to encoder are indexed each sound channel It is routed in output channels；

Sending module, for lower mixed according to the number for determining Microspeaker, the output for being sent to corresponding sound channel is configured.

In one embodiment, first determining module, comprising:

Acquisition submodule, for obtain the microphone on the mobile terminal receive voice signal on a direction with The reaching time-difference between voice signal on another direction, the microphone on the mobile terminal is for receiving at least four Voice signal on direction；

Determine submodule, for according to the microphone in acquired reaching time-difference and the mobile terminal in the shifting Position in dynamic terminal, determines direction of the sound source relative to the mobile terminal.

In one embodiment, the acquisition submodule is specifically used for determining sound position according to audio location algorithm Time difference and intensity difference.

It is an advantage of the current invention that the method that a kind of processing voice signal of the invention realizes three-dimensional sound field auditory effect, This method can promote impression of the human auditory to moving three-dimensional object, and then provide more lively three-dimensional environment, promote institute's mould The quality of quasi- three-dimensional sound field.

Other features and advantages of the present invention will be illustrated in the following description, also, partly becomes from specification It obtains it is clear that understand through the implementation of the invention.The objectives and other advantages of the invention can be by written explanation Specifically noted structure is achieved and obtained in book, claims and attached drawing.

Below by drawings and examples, technical scheme of the present invention will be described in further detail.

Detailed description of the invention

Attached drawing is used to provide further understanding of the present invention, and constitutes part of specification, with reality of the invention It applies example to be used to explain the present invention together, not be construed as limiting the invention.In the accompanying drawings:

Fig. 1 is the method flow diagram that processing voice signal provided in an embodiment of the present invention realizes three-dimensional sound field auditory effect；

Fig. 2 is that source of students provided in an embodiment of the present invention propagates schematic diagram；

Fig. 3 is the device block diagram that processing voice signal provided in an embodiment of the present invention realizes three-dimensional sound field auditory effect.

Specific embodiment

Exemplary embodiments of the present disclosure are described in more detail below with reference to accompanying drawings.Although showing the disclosure in attached drawing Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here It is limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure It is fully disclosed to those skilled in the art.

The embodiment of the invention provides a kind of methods that processing voice signal realizes three-dimensional sound field auditory effect, for moving Terminal, it is shown referring to Fig.1, comprising:

S11, from three-dimensional sound field obtain multiple directions on voice signal；

S12, according to the voice signal, determine direction of the sound source relative to mobile terminal；

S13, the direction according to the voice signal and the sound source relative to the mobile terminal generate space audio Signal；The spatial audio signal is for simulating the three-dimensional sound field；

S14, determine that the mobile terminal has the number of Microspeaker；

S15, it is decoded according to each sound channel of the audio decoder to multichannel program；

S16, the lower mixed scale factor provided according to encoder and transformation matrix index are routed to each sound channel defeated In sound channel；

S17, the lower mixed number according to determining Microspeaker, are sent to the output configuration of corresponding sound channel.

In the present embodiment, the embodiment of the present invention can be applied to a kind of mobile terminal, be disposed with microphone on mobile terminal, Microphone obtains from three-dimensional sound field voice signal and by transmission of sound signals to movement for measuring to three-dimensional sound field Processor in terminal carries out enhancing processing, and can be by the transmission of sound signals of enhancing front and back to the storage in mobile terminal Device is stored.

Specifically, the microphone arranged on mobile terminal can be the microphone of full directing, it is also possible to have certain The microphone of directive property, for example, can specifically arrange MEMS (Micro-Electro- on mobile terminals MechanicalSystem, MEMS) microphone or ECM (ElectretCondenserMicrophones, electret Body capacitance microphone) microphone.

In the present embodiment, the arrangement of microphone on mobile terminals can there are many, in different arrangements The quantity of microphone on middle mobile terminal and position do not limit, and choose 4 microphones and 3 microphones in the present embodiment In case where be illustrated.Such as: 4 microphones can be separately positioned on to four angles of mobile terminal on mobile terminals On.4 microphones can also be separately positioned on four sides of mobile terminal.It can also bottom edge, front in mobile terminal Earpiece beside, back side camera nearby and back side bottom edge nearby respectively setting one microphone；Again for example: can be mobile whole 3 microphones are only set on end.

In step S12, according to acquired voice signal, direction of the sound source relative to the mobile terminal is obtained.Reference Shown in Fig. 2, the sound issued from sound source can be propagate directly to left and right ear, but because left ear is close from sound source, sound Left ear can first be reached and reach auris dextra again, due to the decaying in communication process, the sound that left ear is heard is bigger than auris dextra, this is straight The voice signal connect, brain can receive the two ears and be transmitted through the signal come.Meanwhile it also can be by surrounding from the sound that sound source issues Object reflection, these reflectivity direct signals have certain delay and volume is smaller, these are indirect voice signals.Greatly Brain can collect direct signal and all indirect signals and compared with the signal that is acquired from left ear and auris dextra, be calculated analytically, To achieve the effect that location sound source.Understand brain operating mode after, we can by control two sound equipments or The volume of earphone achievees the effect that simulation 3D sound source with delay, and brain is allowed to produce virtual 3D sound scenery.

Mobile terminal is estimated by microphone sound source is issued, and obtains direction of the sound source relative to mobile terminal. In the present embodiment, method sound source estimated can there are many, such as the steerable beam shape based on peak power output At technology, or the location technology based on reaching time-difference, it can also be the location technology etc. based on High-Resolution Spectral Estimation.

Also, mobile terminal can also carry out the voice signal received while estimating sound source position Enhancing processing.Such as: mobile terminal can use the technological means such as Wave beam forming, spatial prediction, auditory scene analysis and obtain respectively Voice signal after a orientation enhancement.Such as: mobile terminal can wrap by the specific method that Wave beam forming enhances voice signal It includes: the fixed beams formation technologies such as delay is added Wave beam forming, filtering is added, or it is quasi- based on minimum variance distortionless response The Adaptive beamformers such as adaptive beam-forming algorithm, linear constraint minimal variance Wave beam forming, secondary lobe cancellation algorithms then Technology can also be Difference Beam formation technology；Mobile terminal can be with by the specific method that spatial prediction enhances voice signal Include: it is expected collected space acoustical signal on certain directions by presetting, then passes through preparatory trained optimal filter Wave device group, the voice signal in all directions for being received the sound receiver on mobile terminal using spatial prediction techniques are turned It changes into it is expected output signal on preset certain directions, so that the noise of the enhanced voice signal of output is most Small and prediction error levels off to zero；In the present embodiment, auditory scene analysis technology specifically can be blind source separation algorithm.

Enhancing obtains the voice signal in at least both direction with beam direction, is received by a microphone The voice signal having on one direction of beam direction that voice signal is handled by enhancing, such as: it can divide into The voice signal in all directions that the different directions sound source such as front, rear, left side, right side is sent to mobile terminal.Such as: it moves The voice signal that dynamic end-on receives generates the enhanced voice signal of four direction after carrying out enhancing processing, is respectively provided with a left side Front side, forward right side, left rear side, right lateral side beam direction；Also the enhanced voice signal difference of four direction can be generated With front side, rear side, left side, right side beam direction.It in the present embodiment, can also according to specific needs, by multiple sides The voice signal on some assigned direction is synthesized to the voice signal of being differently directed property, and is had by what enhancing was handled The beam shape of the voice signal of beam direction can be directed toward to be heart-shaped, be also possible to the other shapes such as super heart.103, root Direction and acquired voice signal according to the sound source relative to the mobile terminal, obtain spatial audio signal.

Wherein, for simulating the three-dimensional sound field, the three-dimensional sound field can be understood as moving the spatial audio signal The sound field of terminal surrounding in a certain range, sound source can make a sound signal from three-dimensional sound field any direction, and by mobile whole End receives.

Such as: mobile terminal is believed using sound source relative to the sound on the direction of mobile terminal and all directions received Number, the spatial audio signal with 5.1 channel playback systems simulation sound field is generated, mobile terminal can be each by what is obtained after enhancing Voice signal on direction is mapped as needed for the spatial audio signal for forming 5.1 channel playback systems simulation sound field in total Voice signal on six direction, and the direction using sound source relative to mobile terminal are further promoted with 5.1 channel playback systems The separating degree of voice signal needed for the spatial audio signal of system simulation sound field on six direction, such as: mobile terminal can be with Direction according to sound source relative to mobile terminal calculates the space audio that sound field is simulated in each direction with 5.1 channel playback systems The gain adjustment parameter of voice signal needed for signal, and simulated using gain adjustment parameter 5.1 channel playback systems of adjustment Voice signal needed for the spatial audio signal of sound field.Spatial audio signal is including at least the signal on left direction, right side side Upward signal, the signal in middle side direction, left back circular signal, it is right after around signal.

Wherein, the voice signal in all directions that mobile terminal receives simulates sound field with 5.1 channel playback systems Spatial audio signal needed for corresponding relationship between voice signal on six direction can there are many.Such as: to movement The voice signal that terminal receives is enhanced and is exported the voice signal on four direction, it is respectively left front, left back, right before with And behind the right side；And the upward voice signal in left front is mapped as to simulate the spatial audio signal of sound field with 5.1 channel playback systems Voice signal on required left direction；The upward voice signal in right front is mapped as being simulated with 5.1 channel playback systems Voice signal on right direction needed for the spatial audio signal of sound field；According to the upward voice signal in left front and right front Upward voice signal seeks average signal and is mapped as average signal to simulate the space sound of sound field with 5.1 channel playback systems Signal in middle side direction needed for frequency signal；The upward voice signal in left back is mapped as with 5.1 channel playback system moulds Left back surround sound signal needed for the spatial audio signal of onomatopoeia field；The upward voice signal in right back is mapped as with 5.1 Surround sound signal behind the right side needed for channel playback system simulates the spatial audio signal of sound field；According to left front direction, left back The low-pass filtering treatment of 150Hz is carried out to, right front to the voice signal averaged upward with right back and to average value, Obtain the bass signal of weight needed for simulating the spatial audio signal of sound field with 5.1 channel playback systems.

Again for example: enhanced the voice signal that mobile terminal receives and exported the voice signal on four direction, Voice signal respectively on front side, rear side, left side and right side four direction；And by left direction and the upward sound in front side The average signal of sound signal is mapped as on left direction needed for simulating the spatial audio signal of sound field with 5.1 channel playback systems Voice signal；The average signal of right direction and the upward voice signal in front side is mapped as with 5.1 channel playback systems Voice signal on right direction needed for simulating the spatial audio signal of sound field；The upward voice signal in front side is mapped as Signal in middle side direction needed for spatial audio signal with 5.1 channel playback systems simulation sound field；By left direction with after The average signal of voice signal in side direction is mapped as needed for the spatial audio signal for simulating sound field with 5.1 channel playback systems Left back surround sound signal；The average signal of voice signal on right direction and rear direction is mapped as with 5.1 sound channels Surround sound signal behind the right side needed for playback system simulates the spatial audio signal of sound field；According to front side direction, rear direction, a left side Voice signal averaged in side direction and right direction and the low-pass filtering treatment that 150Hz is carried out to average value, obtain The bass signal of weight needed for spatial audio signal with 5.1 channel playback systems simulation sound field.

The method of processing voice signal provided in an embodiment of the present invention, can obtain sound from mobile terminal three-dimensional sound field Signal；And obtain direction of each sound source relative to mobile terminal；Recycle direction and sound of the sound source relative to mobile terminal Signal obtains the spatial audio signal for simulating three-dimensional sound field.

Scheme provided by the invention can be acquired by the element of mobile terminal itself and be handled for simulating three-dimensional sound field Voice signal, and direction of the sound source in received all directions relative to mobile terminal is analyzed, in conjunction with each The effect of sound source on a direction relative to the orientation enhancement three-dimensional sound field of mobile terminal, compared with the existing technology in merely with wave Beam simulates the scheme of three-dimensional sound field, due to obtaining direction of the sound source in all directions relative to mobile terminal in the present invention, And further gain adjustment is carried out to Wave beam forming using direction information, therefore the three dimensional sound for simulating and can be extenuated The differentiation of the left and right of the field phenomenon more obvious than front and back, to promote the quality of simulated three-dimensional sound field.

In the present embodiment, acquisition for mobile terminal sound source can have more relative to the mode in the direction of the mobile terminal Kind, for example can be obtained using location technology as the mobile terminal of sound source and the space coordinate of mobile terminal, and according to conduct The mobile terminal of sound source and the space coordinate of mobile terminal determine direction of the sound source relative to the mobile terminal.But in movement Terminal needs to occupy network bandwidth during being positioned, and position fixing process has certain delay, and in the present embodiment, Mobile terminal needs to obtain direction of the sound source in multiple directions relative to mobile terminal, can be by based on reaching time-difference Location technology obtains direction of the sound source relative to the mobile terminal.

In order to simulate the interaction between true sound wave and ear, resonate audio (Resonance Audio) technology Head has been used to be associated with propagation function (Head-Related Transfer Functions, HRTFs) audio location algorithm. HRTFs includes the influence for the time difference and intensity difference that determine sound position, and the frequency spectrum effect for determining sound position Answer, the algorithm calculate sound wave pass through after transmitting, reflection head, ear various effects, how the nervous system for simulating people to go Judge the vertical height (elevation) of sound source position, especially sound source.The sense of hearing audio of HRTFs processing is used by earphone User's brain is set to generate a kind of illusion, i.e., having a specific position in virtual world of the sound around them, (this just reaches The target of the simulator locating of sound source).In addition to this, resonance audio (Resonance Audio) technology can not only simulate sound wave With ear interaction, moreover it is possible to simulate the interaction of sound wave Yu its ambient enviroment.

Sound localization method based on microphone array can substantially be divided into three classes: the controllable wave based on peak power output Beam formed technology, based on high-resolution spectrogram estimation technique and based on sound time it is poor (time-delay estimation, TDE auditory localization technology).

Algorithm core based on TDE is the accurate estimation to propagation delay, generally by doing mutually between signal microphone Relevant treatment obtains.Sound source position information is further obtained, can be calculated by the summation that is simply delayed, geometry or directly sharp The methods of controlled power response search is carried out with cross correlation results.This kind of algorithm realization is relatively easy, and operand is small, convenient for real-time Processing, therefore in practice with most wide.

In addition, audio decoder can carry out lower mixed, this is needed when output channels number is fewer than coding channels number.This Sample, complete multichannel program can be reset in less sound channel.Decoder is decoded each sound channel, and uses coding These sound channels are routed in output channels (these scale factors setting by lower mixed scale factor and the transformation matrix index that device provides The relative level of each sound channel, and corresponding transformation inverse matrix is obtained based on transformation matrix index, also using transformation inverse matrix Sound channel unit after former corresponding quantization).It is lower mixed can be sent to 3 to 8 sound channels output configuration (3 to 8 sound channels it is defeated Configuration is suitable for mobile terminal out).

The number of Microspeaker can be determined according to the size of mobile terminal；Audio decoder is each to multichannel program A sound channel is decoded；And the lower mixed scale factor and transformation matrix provided using encoder index these sound channels are routed to it is defeated In sound channel；It is lower mixed according to determining Microspeaker number, it is sent to the output configuration of corresponding sound channel.

Second aspect, the present invention also provides the device that a kind of processing voice signal realizes three-dimensional sound field auditory effect, references Shown in Fig. 3, comprising:

Obtain module 31, for from three-dimensional sound field obtain multiple directions on voice signal；

First determining module 32, for determining direction of the sound source relative to mobile terminal according to the voice signal；

Generation module 33, for the direction according to the voice signal and the sound source relative to the mobile terminal, Generate spatial audio signal；The spatial audio signal is for simulating the three-dimensional sound field；

Second determining module 34, for determining that the mobile terminal has the number of Microspeaker；

Decoder module 35, for being decoded according to each sound channel of the audio decoder to multichannel program；

Routing module 36, lower mixed scale factor and transformation matrix for being provided according to encoder are indexed each sound Road is routed in output channels；

Sending module 37, for lower mixed according to the number for determining Microspeaker, the output for being sent to corresponding sound channel is configured.

In one embodiment, first determining module 32, comprising:

Acquisition submodule 321, for obtain the microphone on the mobile terminal receive on a direction sound letter Number and another direction on voice signal between reaching time-difference, the microphone on the mobile terminal is for receiving at least Voice signal on four direction；

Determine submodule 322, for according to the microphone in acquired reaching time-difference and the mobile terminal in institute The position on mobile terminal is stated, determines direction of the sound source relative to the mobile terminal.

In one embodiment, the acquisition submodule 321 is specifically used for determining sound position according to audio location algorithm The time difference set and intensity difference.

It should be noted last that the above examples are only used to illustrate the technical scheme of the present invention and are not limiting.Although ginseng It is described the invention in detail according to embodiment, those skilled in the art should understand that, to technical side of the invention Case is modified or replaced equivalently, and without departure from the spirit and scope of technical solution of the present invention, should all be covered in the present invention Scope of the claims in.

Claims

1. a kind of method that processing voice signal realizes three-dimensional sound field auditory effect, is used for mobile terminal, which is characterized in that packet It includes:

Direction according to the voice signal and the sound source relative to the mobile terminal generates spatial audio signal；Institute Spatial audio signal is stated for simulating the three-dimensional sound field；

Determine that the mobile terminal has the number of Microspeaker；

The lower mixed scale factor and transformation matrix provided according to encoder, which indexes, to be routed to each sound channel in output channels；

2. the method as described in claim 1, which is characterized in that according to the voice signal, determine sound source relative to mobile whole The direction at end, comprising:

It obtains the microphone on the mobile terminal and receives the voice signal on a direction and the sound on another direction Reaching time-difference between signal, the microphone on the mobile terminal are used to receive the voice signal at least four directions；

3. method according to claim 2, which is characterized in that obtain the microphone on the mobile terminal and receive a side The reaching time-difference between voice signal on upward voice signal and another direction, comprising:

4. the device that a kind of processing voice signal realizes three-dimensional sound field auditory effect characterized by comprising

Generation module generates empty for the direction according to the voice signal and the sound source relative to the mobile terminal Between audio signal；The spatial audio signal is for simulating the three-dimensional sound field；

Routing module, lower mixed scale factor and transformation matrix index for being provided according to encoder route each sound channel Into output channels；

5. device as claimed in claim 4, which is characterized in that first determining module, comprising:

Acquisition submodule, for obtain the microphone on the mobile terminal receive voice signal on a direction with it is another The reaching time-difference between voice signal on a direction, the microphone on the mobile terminal is for receiving at least four directions On voice signal；

Submodule is determined, for whole in the movement according to the microphone in acquired reaching time-difference and the mobile terminal Position on end determines direction of the sound source relative to the mobile terminal.

6. device as claimed in claim 5, which is characterized in that the acquisition submodule is specifically used for being positioned according to audio and calculate Method determines time difference and the intensity difference of sound position.