WO2022151657A1 - Noise cancellation method and apparatus, and audio device and computer-readable storage medium - Google Patents
Noise cancellation method and apparatus, and audio device and computer-readable storage medium Download PDFInfo
- Publication number
- WO2022151657A1 WO2022151657A1 PCT/CN2021/101463 CN2021101463W WO2022151657A1 WO 2022151657 A1 WO2022151657 A1 WO 2022151657A1 CN 2021101463 W CN2021101463 W CN 2021101463W WO 2022151657 A1 WO2022151657 A1 WO 2022151657A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- user
- noise reduction
- call
- ambient noise
- energy
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 60
- 230000008569 process Effects 0.000 claims abstract description 23
- 230000009467 reduction Effects 0.000 claims description 177
- 230000007613 environmental effect Effects 0.000 claims description 49
- 238000004590 computer program Methods 0.000 claims description 27
- 230000004913 activation Effects 0.000 claims description 4
- 230000003044 adaptive effect Effects 0.000 abstract description 8
- 230000006870 function Effects 0.000 description 29
- 230000000875 corresponding effect Effects 0.000 description 16
- 230000009286 beneficial effect Effects 0.000 description 15
- 238000004891 communication Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 238000011946 reduction process Methods 0.000 description 2
- 230000002829 reductive effect Effects 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 230000001276 controlling effect Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R1/00—Details of transducers, loudspeakers or microphones
- H04R1/10—Earpieces; Attachments therefor ; Earphones; Monophonic headphones
- H04R1/1083—Reduction of ambient noise
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R3/00—Circuits for transducers, loudspeakers or microphones
Definitions
- the present disclosure relates to the technical field of audio processing, and in particular, to a noise reduction method, an apparatus, an audio device, and a computer-readable storage medium.
- ANC Active Noise Cancellation
- the headset While pursuing an immersive experience, if the user uses the headset to talk, and the headset does not automatically remove the active noise reduction function, the user will not have background noise as a reference during the call, and the volume of the sound cannot be well controlled. It will affect the surrounding people due to the loud sound, and it will also make the other party feel awkward and cause discomfort to the other party. In addition, due to the existence of environmental noise, it may interfere with the voice of the call, so that the other party of the call cannot receive high-quality voice signals, thereby reducing the quality of the call.
- An embodiment of the present disclosure provides a noise reduction method, which is applied to an earphone, and the method includes:
- the earphone is controlled to enable the ambient noise reduction mode.
- the process of detecting that the user is talking includes:
- volume of the target user during the call is higher than a preset volume threshold, it is determined that the user is on a call.
- the process of determining that the user is a target user includes:
- the matching degree is higher than a preset matching degree threshold, it is determined that the user is the target user.
- the process of determining, according to the sound energy and ambient noise energy, that the conditions for enabling ambient noise reduction are met includes:
- the ratio of the user's sound energy to the environmental noise energy is greater than a preset signal-to-noise ratio threshold, it is determined that the activation environment noise reduction condition is satisfied.
- An embodiment of the present disclosure provides a noise reduction device, which is applied to an earphone, and the device includes:
- a first control module configured to control the headset to exit the active noise reduction mode when it is detected that the user is talking when the headset is in the active noise reduction mode
- an acquisition module for acquiring the user's voice energy and ambient noise energy during the call
- the second control module is configured to control the earphone to activate the ambient noise reduction mode when it is determined according to the sound energy and the ambient noise energy that the conditions for enabling ambient noise reduction are satisfied.
- An embodiment of the present disclosure provides an audio device, including a memory and a processor, where the memory stores a computer program, and the processor implements the steps of the noise reduction method provided by any embodiment of the present disclosure when the processor executes the computer program.
- An embodiment of the present disclosure provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the steps of the noise reduction method provided by any embodiment of the present disclosure.
- the noise reduction method, device, audio device, and computer-readable storage medium control the headset to exit the active noise reduction mode when it is detected that the user is using the headset to talk, that is, the active noise reduction function is automatically turned off during the call, Therefore, the background noise is provided for the user as a reference, which is beneficial for the user to better control the volume, and avoid discomfort caused by the surrounding people or the other party of the call due to the loud sound.
- the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
- FIG. 1 is a schematic flowchart of a noise reduction method in one embodiment
- FIG. 2 is a schematic flowchart of a process of detecting that a user is talking in an embodiment
- FIG. 3 is a schematic flowchart of a process of determining a user as a target user in one embodiment
- FIG. 4 is a structural block diagram of a noise reduction device in one embodiment
- FIG. 5 is an internal structure diagram of an audio device in one embodiment.
- a noise reduction method is provided.
- the noise reduction method is mainly applied to an audio device, such as an earphone, for illustration.
- the earphone can be a wired earphone or a wireless communication earphone, such as a bluetooth earphone.
- the noise reduction method includes the following steps:
- Step S1 When the headset is in the active noise reduction mode, if it is detected that the user is talking, the headset is controlled to exit the active noise reduction mode.
- the earphone is in the active noise reduction mode, that is, the active noise reduction function of the earphone is turned on.
- the earphone will simulate a sound wave of the same frequency as the noise, and the anti-phase sound wave can partially or completely cancel the noise, so that the noise can be partially or completely canceled.
- the headset is controlled to exit the active noise reduction mode, that is, the active noise reduction function is turned off, the background noise is no longer reduced, and a natural noise state is presented. It enables users to clearly perceive the volume of their own voice, provides a reference for the comparison of the user's speaking volume, and then effectively controls the speaking volume, avoiding the discomfort caused to the surrounding people and the other party due to the excessively loud voice of the call, or because the voice of the call is too low As a result, the other party of the call cannot hear clearly, which in turn affects the call process and the call experience, thereby ensuring the balance of the user's call sound. If it is detected that the user is not talking, the corresponding action is not performed, that is, the headset does not exit the active noise reduction mode, and the active noise reduction function continues to be turned on.
- Step S2 Acquire the sound energy of the user and the ambient noise energy of the environment where the user is located during the call.
- the user's voice during the call and the environmental noise of the environment where the user is located can be collected through an audio collection and processing device such as a microphone, and the corresponding volume levels can be obtained through analysis.
- the sound wave energy corresponding to the user's voice that is, the user's sound energy
- the sound wave energy corresponding to the ambient noise in the environment where the user is located that is, the ambient noise energy
- Step S3 If it is determined according to the sound energy and the environmental noise energy that the conditions for enabling environmental noise reduction are met, control the earphone to start the environmental noise reduction mode.
- the headset activates the Environmental Noise Cancellation (ENC) mode, that is, the headset can accurately calculate the direction of the user's speech through the dual-microphone array, and remove various interfering noises in the environment while protecting the target voice in the main direction.
- EEC Environmental Noise Cancellation
- the headset when the user talks through the headset, it is judged whether the conditions for enabling environmental noise reduction are satisfied according to the sound energy and ambient noise energy obtained in the above step S2, and if the conditions for enabling environmental noise reduction are satisfied, the headset is controlled Activate the environmental noise reduction mode to suppress or eliminate various noises in the environment, realize adaptive environmental noise reduction, improve the anti-interference of environmental noise during the call, and ensure that high-quality voice signals are provided for the caller, thereby improving call quality. . If the conditions for enabling ambient noise reduction are not met, the corresponding action is not performed, that is, the ambient noise reduction mode is not activated.
- the headset when it is detected that the user is using the headset to talk, the headset is controlled to exit the active noise reduction mode, that is, the active noise reduction function is automatically turned off during the call, so as to provide the user with background noise as a reference, which is conducive to better control by the user.
- the volume is too loud to avoid discomfort to the people around you or the other party in the call.
- the headset when it is determined that the conditions for enabling ambient noise reduction are met, the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
- step S1 the process of detecting that the user is on a call includes the following steps:
- Step S11 Determine the user as the target user.
- the target user is, for example, the user to which the headset belongs, that is, it is determined that the user currently talking is the user to which the headset belongs, so as to avoid misoperation caused by false detection and improve reliability. For example, when it detects other people who are talking, control the headset to exit the active noise reduction mode, resulting in misoperation.
- the headset is in active noise reduction mode. If someone is talking around the user, the headset may recognize the voice of the call and mistakenly think it is due to the small distance between them. When the user is on a call, he may exit the active noise reduction mode, causing misoperation. In fact, the user is listening to music, and exiting the active noise reduction mode at this time will greatly affect the user's experience and reduce the user's experience. Therefore, in the embodiment of the present disclosure, it is preferred to determine the user who is talking as the target user, that is, the user to which the headset belongs, so as to prevent misidentification and misoperation from affecting the user experience and experience, and improve reliability.
- step S11 it is determined that the person who is talking is the user himself, not other people.
- Step S12 If the volume of the target user during the call is higher than the preset volume threshold, it is determined that the user is on a call.
- the headset After it is determined that the user who is on the call is the target user, it is also necessary to determine whether the volume of the target user's voice during the call satisfies a certain condition, that is, to determine whether the volume of the target user during the call is higher than the preset volume threshold. If so, it is determined that the target user is talking, thereby further avoiding misidentification and misoperation, and improving reliability. If not, that is, when the target user's volume during the call is less than or equal to the preset volume threshold, it is determined that the target user is not talking, no operation is performed, and the headset continues to maintain the active noise reduction mode.
- a certain condition that is, to determine whether the volume of the target user during the call is higher than the preset volume threshold. If so, it is determined that the target user is talking, thereby further avoiding misidentification and misoperation, and improving reliability. If not, that is, when the target user's volume during the call is less than or equal to the preset volume threshold, it is determined that the target user is not talking
- the user's speaking voice will reach a certain level, such as reaching a preset volume threshold, which is beneficial for the other party to receive the sound signal clearly.
- a certain level such as reaching a preset volume threshold
- the user's voice is probably not talking. For example, the user is talking with the surrounding people in a low voice, or the user is currently listening to music through headphones and sings along with the music softly. If it is mistakenly recognized as a call sound, it will cause a misoperation, that is, the headset will exit the active noise reduction mode, which will affect the user's experience and experience.
- the headset after determining the target user, it is necessary to further determine whether the user's volume is higher than the preset volume threshold, and after determining that the user's volume is higher than the preset volume threshold, determine that the user is talking , at this time, the headset exits the active noise reduction mode, thereby further avoiding misidentification and misoperation, avoiding affecting the user's experience and experience, and improving reliability.
- the preset volume threshold is, for example, a preset volume threshold, which is based on, for example, determining as accurately as possible whether the target user is on a call.
- the preset volume threshold is, for example, 40 decibels, that is, when it is recognized that the volume of the target user during the call is greater than 40 decibels, it is determined that the user is talking. Otherwise, it is considered that the user is not talking, and the headset continues to maintain the active noise reduction mode.
- step S11 the process of determining the user as the target user includes the following steps:
- Step S111 Identify the voiceprint feature of the user during the call.
- Step S112 Match the voiceprint feature with a preset reference voiceprint corresponding to the target user.
- the target user that is, the reference voiceprint of the user to which the headset belongs is pre-stored in the headset, and then the user's voiceprint feature actually collected during the call is matched with the pre-stored reference voiceprint feature of the target user, It is beneficial to accurately determine whether the user is the target user, that is, whether the user belongs to the headset.
- Step S113 If the matching degree is higher than the preset matching degree threshold, determine the user as the target user.
- the user is determined to be the target user, that is, the user to which the headset belongs. , so that the user identity can be accurately identified, the target user can be determined, and misidentification can be prevented.
- the preset matching degree threshold is, for example, a preset value, and its value is based on, for example, determining as accurately as possible whether the user who is talking is the target user. It is understandable that the preset matching degree threshold value should not be too high or too low. If the value is too high, the target user may not be recognized due to environmental disturbances, and there will be an error of abandonment; if the value is too low, it may The voices of other users who are not the target user are recognized as the target user, resulting in a false acceptance error.
- the preset matching degree threshold is, for example, 95%.
- the process of detecting whether the user is on a call includes, for example: first, identifying the calling user as a target user, not other users.
- the voiceprint feature of the target user that is, the reference voiceprint
- the headset pre-recorded in the headset
- the user's voiceprint feature actually collected during the call is compared with the reference voiceprint. If the comparison result reaches a certain confidence level, If it is 95%, it is determined that the user who is talking is the target user, that is, the user to which the headset belongs, thereby effectively avoiding misrecognition and improving the recognition accuracy of the target user.
- the headset exits the active noise reduction mode. Otherwise, it is considered that the user is not talking, and the headset continues to maintain the active noise reduction mode.
- a preset volume threshold such as 40 decibels
- step S3 determining a process that satisfies the conditions for enabling environmental noise reduction according to the sound energy and the ambient noise energy includes: if the ratio of the user's sound energy to the ambient noise energy is greater than a preset signal-to-noise ratio If the threshold is set, it is determined that the conditions for enabling ambient noise reduction are met, and the headset starts the ambient noise reduction mode. Otherwise, that is, the ratio of the user's sound energy to the ambient noise energy is less than or equal to the preset signal-to-noise ratio threshold, it is determined that the conditions for enabling ambient noise reduction are not met, and the headset does not enable the ambient noise reduction mode.
- the preset signal-to-noise ratio threshold is a preset value, which is based on, for example, determining as accurately as possible whether the earphone satisfies the ambient noise reduction condition according to the relationship between sound energy and ambient noise energy.
- the preset signal-to-noise ratio threshold is, for example, a value greater than 1, such as 1.5.
- a signal-to-noise ratio threshold can be set according to the sound energy and environmental noise energy, expressed as: sound energy/environmental noise energy, that is, the ratio of sound energy to environmental noise energy .
- sound energy/environmental noise energy that is, the ratio of sound energy to environmental noise energy .
- the environmental noise reduction process is self-destructive to the useful sound wave signal, so when the user's sound energy is less than or equal to the environmental noise energy, the environmental noise reduction mode is generally not turned on, otherwise the user's voice will be eliminated.
- the preset signal-to-noise ratio threshold is generally set to a value greater than 1, such as 1.5.
- the ambient noise reduction mode is automatically activated to reduce ambient noise. Therefore, it can provide clear and high-quality voice signals for the other party and improve the quality of the call.
- the active noise reduction ANC function when it is detected that the user is speaking by himself, the active noise reduction ANC function is automatically turned off to present a natural background noise state, so that the user can effectively control the volume; further It can automatically determine whether to activate the environmental noise reduction ENC function according to the current environmental noise, so as to provide a relatively high-quality voice signal for the caller and improve the call quality.
- steps in the flowcharts of FIGS. 1-3 are shown in sequence according to the arrows, these steps are not necessarily executed in the sequence shown by the arrows. Unless explicitly stated herein, the execution of these steps is not strictly limited to the order, and these steps may be performed in other orders. Moreover, at least a part of the steps in FIGS. 1-3 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed and completed at the same time, but may be executed at different times. These sub-steps or stages are not necessarily completed at the same time. The order of execution of the steps is not necessarily sequential, but may be performed alternately or alternately with other steps or at least a part of sub-steps or stages of other steps.
- a noise reduction device is provided.
- the noise reduction device is mainly applied to an audio device, such as an earphone, for illustration.
- the earphone may be a wired earphone, or a earphone using wireless communication, such as a Bluetooth earphone.
- the noise reduction device 100 includes: a first control module 110 , an acquisition module 120 and a second control module 130 .
- the first control module 110 is configured to control the headset to exit the active noise reduction mode when it is detected that the user is talking when the headset is in the active noise reduction mode.
- the earphone is in the active noise reduction mode, that is, the active noise reduction function of the earphone is turned on.
- the earphone will simulate a sound wave of the same frequency as the noise, and the anti-phase sound wave can partially or completely cancel the noise, so that the noise can be partially or completely canceled.
- the headset is controlled to exit the active noise reduction mode, that is, the active noise reduction function is turned off, the background noise is no longer reduced, and a natural noise state is presented. It enables users to clearly perceive the volume of their own voice, provides a reference for the comparison of the user's speaking volume, and then effectively controls the speaking volume, avoiding the discomfort caused to the surrounding people and the other party due to the excessively loud voice of the call, or because the voice of the call is too low As a result, the other party of the call cannot hear clearly, which in turn affects the call process and the call experience, thereby ensuring the balance of the user's call sound. If it is detected that the user is not talking, the corresponding action is not performed, that is, the headset does not exit the active noise reduction mode, and the active noise reduction function continues to be turned on.
- the obtaining module 120 is configured to obtain the sound energy of the user and the ambient noise energy of the environment where the user is located during the call.
- the acquisition module 120 includes, for example, an audio collection and processing device such as a microphone, which can collect the user's voice during the call and the ambient noise of the user's environment through the microphone and other audio collection and processing devices, and analyze the corresponding volume levels.
- an audio collection and processing device such as a microphone
- the sound wave energy corresponding to the user's voice that is, the user's sound energy
- the sound wave energy corresponding to the ambient noise in the environment where the user is located that is, the ambient noise energy
- the second control module 130 is configured to control the earphone to activate the ambient noise reduction mode when it is determined according to the sound energy and the ambient noise energy that the conditions for enabling ambient noise reduction are satisfied.
- the headset activates the environmental noise reduction mode, that is, the headset can accurately calculate the direction of the user's speech through the dual-microphone array, and remove various interfering noises in the environment while protecting the target voice in the main direction.
- the second control module 130 determines whether the startup environment noise reduction condition is satisfied. Noise conditions, control the headset to start the environmental noise reduction mode, so as to suppress or eliminate various noises in the environment, realize adaptive environmental noise reduction, improve the anti-interference of environmental noise during the call, and ensure high-quality voice for the other party. signal to improve call quality. If the conditions for enabling ambient noise reduction are not met, the corresponding action is not performed, that is, the ambient noise reduction mode is not activated.
- the headset when it is detected that the user is using the headset to talk, the headset is controlled to exit the active noise reduction mode, that is, the active noise reduction function is automatically turned off during the call, so as to provide the user with background noise as a reference, which is beneficial for the user to better. Control the volume to avoid discomfort to the people around you or the caller due to the loud sound.
- the headset when it is determined that the conditions for enabling ambient noise reduction are met, the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
- the first control module 110 detects a process in which the user is talking, specifically including: determining that the user is the target user, and when the target user's volume during the call is higher than a preset volume threshold, determining A user is detected on a call.
- the target user is, for example, the user to which the headset belongs, that is, it is determined that the user currently talking is the user to which the headset belongs, so as to avoid misoperation caused by false detection and improve reliability. For example, when it detects other people who are talking, control the headset to exit the active noise reduction mode, causing misoperation.
- the headset is in active noise reduction mode. If someone is talking around the user, the headset may recognize the call sound and mistakenly think it is due to the small distance between them. When the user is on a call, he may exit the active noise reduction mode, causing misoperation. In fact, the user is listening to music, and exiting the active noise reduction mode at this time will greatly affect the user's experience and reduce the user's experience. Therefore, in the embodiment of the present disclosure, it is preferred to determine the user who is talking as the target user, that is, the user to which the headset belongs, so as to prevent misidentification and misoperation from affecting the user experience and experience, and improve reliability.
- the headset continues to maintain the active noise reduction mode.
- the user's speaking voice will reach a certain level, such as reaching a preset volume threshold, which is beneficial for the other party to receive the sound signal clearly.
- a certain level such as reaching a preset volume threshold
- the user's voice is probably not talking. For example, the user is talking with the surrounding people in a low voice, or the user is currently listening to music through headphones and sings along with the music softly. If it is mistakenly recognized as a call sound, it will cause a misoperation, that is, the headset will exit the active noise reduction mode, which will affect the user's experience and experience.
- the headset after determining the target user, it is necessary to further determine whether the user's volume is higher than the preset volume threshold, and after determining that the user's volume is higher than the preset volume threshold, determine that the user is talking , at this time, the headset exits the active noise reduction mode, thereby further avoiding misidentification and misoperation, avoiding affecting the user's experience and experience, and improving reliability.
- the preset volume threshold is, for example, a preset volume threshold, which is based on, for example, determining as accurately as possible whether the target user is on a call.
- the preset volume threshold is, for example, 40 decibels, that is, when it is recognized that the volume of the target user during the call is greater than 40 decibels, it is determined that the user is talking. Otherwise, it is considered that the user is not talking, and the headset continues to maintain the active noise reduction mode.
- the process of determining the user as the target user by the first control module 110 specifically includes: identifying the voiceprint feature of the user during the call; comparing the voiceprint feature with a preset benchmark corresponding to the target user The voiceprint is matched; when the matching degree is higher than the preset matching degree threshold, the user is determined as the target user.
- the target user that is, the reference voiceprint of the user to which the headset belongs is pre-stored in the headset, and then the user's voiceprint feature actually collected during the call is matched with the pre-stored target user's reference voiceprint feature, It is beneficial to accurately determine whether the user is the target user, that is, whether the user belongs to the headset. If the matching degree between the voiceprint feature of the user actually collected during the call and the reference voiceprint feature of the pre-stored target user is higher than the preset matching degree threshold, the user is determined as the target user, that is, the user to which the headset belongs, so that accurate Identify user identities, determine target users, and prevent misidentification.
- the preset matching degree threshold is, for example, a preset value, and its value is based on, for example, determining as accurately as possible whether the user who is talking is the target user. It is understandable that the preset matching degree threshold value should not be too high or too low. If the value is too high, the target user may not be recognized due to environmental disturbances, and there will be an error of abandonment; if the value is too low, it may The voices of other users who are not the target user are recognized as the target user, resulting in a false acceptance error.
- the preset matching degree threshold is, for example, 95%.
- the process of the first control module 110 detecting whether the user is on a call includes, for example: first, identifying the calling user as the target user, not other users.
- the voiceprint feature of the target user that is, the reference voiceprint
- the headset pre-recorded in the headset
- the user's voiceprint feature actually collected during the call is compared with the reference voiceprint. If the comparison result reaches a certain level of confidence, If it is 95%, it is determined that the user who is talking is the target user, that is, the user to which the headset belongs, thereby effectively avoiding misrecognition and improving the recognition accuracy of the target user.
- the headset exits the active noise reduction mode. Otherwise, it is considered that the user is not talking, and the headset continues to maintain the active noise reduction mode.
- a preset volume threshold such as 40 decibels
- the second control module 130 determines the process of satisfying the conditions for enabling environmental noise reduction according to the sound energy and the ambient noise energy, and specifically includes: when the ratio of the user's sound energy to the ambient noise energy is greater than a preset signal When the noise ratio threshold is set, it is determined that the conditions for enabling ambient noise reduction are met, and the headset starts the ambient noise reduction mode. Otherwise, that is, the ratio of the user's sound energy to the ambient noise energy is less than or equal to the preset signal-to-noise ratio threshold, it is determined that the conditions for enabling ambient noise reduction are not met, and the headset does not enable the ambient noise reduction mode.
- the preset signal-to-noise ratio threshold is a preset value, which is based on, for example, determining as accurately as possible whether the earphone satisfies the ambient noise reduction condition according to the relationship between sound energy and ambient noise energy.
- the preset signal-to-noise ratio threshold is, for example, a value greater than 1, such as 1.5.
- a signal-to-noise ratio threshold can be set according to the sound energy and environmental noise energy, expressed as: sound energy/environmental noise energy, that is, the ratio of sound energy to environmental noise energy .
- sound energy/environmental noise energy that is, the ratio of sound energy to environmental noise energy .
- the environmental noise reduction process is self-destructive to the useful sound wave signal, so when the user's sound energy is less than or equal to the environmental noise energy, the environmental noise reduction mode is generally not turned on, otherwise the user's voice will be eliminated.
- the preset SNR threshold is generally set to a value greater than 1, such as 1.5.
- the ambient noise reduction mode is automatically activated to reduce ambient noise. Therefore, it can provide clear and high-quality voice signals for the other party and improve the quality of the call.
- the noise reduction device 100 of the embodiment of the present disclosure when using the headset to talk, if it is detected that the user is talking, the active noise reduction ANC function is automatically turned off, showing a natural background noise state, so that the user can effectively control the volume; Further, it can automatically determine whether to activate the environmental noise reduction ENC function according to the current environmental noise, so as to provide a relatively high-quality voice signal for the other party of the call and improve the quality of the call.
- Each module in the above-mentioned noise reduction device may be implemented in whole or in part by software, hardware and combinations thereof.
- the above modules can be embedded in or independent of the processor in the computer device in the form of hardware, or stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
- an audio device is provided, and the audio device may be an earphone, and the internal structure diagram thereof may be as shown in FIG. 5 .
- the audio device includes a processor, a memory, a communication interface, a display screen, and an input device connected through a system bus. Among them, the processor of the audio device is used to provide computing and control capabilities.
- the memory of the audio device includes a non-volatile storage medium and an internal memory.
- the nonvolatile storage medium stores an operating system and a computer program.
- the internal memory provides an environment for the execution of the operating system and computer programs in the non-volatile storage medium.
- the communication interface of the audio device is used for wired or wireless communication with an external terminal, and the wireless communication can be realized by WIFI, operator network, near field communication (NFC), Bluetooth or other technologies.
- the computer program when executed by a processor implements a noise reduction method.
- the display screen of the audio equipment may be a liquid crystal display screen or an electronic ink display screen, and the input device of the audio equipment may be a touch layer covered on the display screen, or a button, a trackball or a touchpad set on the casing of the audio equipment , or an external keyboard, trackpad, or mouse.
- FIG. 5 is only a block diagram of a partial structure related to the solution of the present disclosure, and does not constitute a limitation on the audio device to which the solution of the present disclosure is applied.
- the specific audio device may be Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
- the noise reduction apparatus provided by the present disclosure may be implemented in the form of a computer program, and the computer program may run on the audio device as shown in FIG. 5 .
- the memory of the audio device may store various program modules that constitute the noise reduction apparatus, for example, the first control module 110, the acquisition module 120, and the second control module 130 shown in FIG. 4 .
- the computer program constituted by each program module causes the processor to execute the steps in the noise reduction method of each embodiment of the present disclosure described in this specification.
- the audio device shown in FIG. 5 may execute the aforementioned step S1 through the first control module 110 in the noise reduction apparatus shown in FIG. 4 .
- the audio device may perform the aforementioned step S2 through the obtaining module 120 .
- the audio device may perform the aforementioned step S3 through the second control module 130 .
- an audio device comprising a memory and a processor, the memory stores a computer program, and the processor implements the following steps when executing the computer program:
- Step S1 When the headset is in the active noise reduction mode, if it is detected that the user is talking, the headset is controlled to exit the active noise reduction mode.
- Step S2 Acquire the sound energy and ambient noise energy of the user during the call.
- Step S3 If it is determined according to the sound energy and the environmental noise energy that the conditions for enabling environmental noise reduction are met, control the earphone to start the environmental noise reduction mode.
- the processor further implements the following steps when executing the computer program:
- Step S11 Determine the user as the target user.
- Step S12 If the volume of the target user during the call is higher than the preset volume threshold, it is determined that the user is on a call.
- the processor further implements the following steps when executing the computer program:
- Step S111 Identify the voiceprint feature of the user during the call.
- Step S112 Match the voiceprint feature with a preset reference voiceprint corresponding to the target user.
- Step S113 If the matching degree is higher than the preset matching degree threshold, determine the user as the target user.
- the processor when the processor executes the computer program, the processor further implements the following steps: if the ratio of the user's sound energy to the ambient noise energy is greater than a preset signal-to-noise ratio threshold, it is determined that a condition for enabling ambient noise reduction is satisfied.
- the audio device of the embodiment of the present disclosure controls the headset to exit the active noise reduction mode when it is detected that the user is using the headset to talk, that is, the active noise reduction function is automatically turned off during the call, so as to provide the user with background noise as a reference, which is beneficial to the user. It can control the volume level to avoid the discomfort caused by the surrounding people or the other party due to the loud sound.
- the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
- a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, the following steps are implemented:
- Step S1 When the headset is in the active noise reduction mode, if it is detected that the user is talking, the headset is controlled to exit the active noise reduction mode.
- Step S2 Acquire the sound energy and ambient noise energy of the user during the call.
- Step S3 If it is determined according to the sound energy and the environmental noise energy that the conditions for enabling environmental noise reduction are met, control the earphone to start the environmental noise reduction mode.
- the computer program further implements the following steps when executed by the processor:
- Step S11 Determine the user as the target user.
- Step S12 If the volume of the target user during the call is higher than the preset volume threshold, it is determined that the user is on a call.
- the computer program further implements the following steps when executed by the processor:
- Step S111 Identify the voiceprint feature of the user during the call.
- Step S112 Match the voiceprint feature with a preset reference voiceprint corresponding to the target user.
- Step S113 If the matching degree is higher than the preset matching degree threshold, determine the user as the target user.
- the following steps are further implemented: if the ratio of the user's sound energy to the ambient noise energy is greater than a preset signal-to-noise ratio threshold, it is determined that a condition for enabling ambient noise reduction is satisfied.
- the computer-readable storage medium of the embodiment of the present disclosure controls the headset to exit the active noise reduction mode when it is detected that the user is using the headset to talk, that is, the active noise reduction function is automatically turned off during the call, so as to provide the user with background noise as a reference, which is beneficial to The user can better control the volume and avoid the discomfort caused by the loudness of the surrounding people or the other party in the call.
- the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
- any reference to memory, database or other media used in the various embodiments provided by the present disclosure may include at least one of non-volatile and volatile memory.
- Non-volatile memory may include read-only memory (Read-Only Memory, ROM), magnetic tape, floppy disk, flash memory, or optical memory, and the like.
- Volatile memory may include random access memory (RAM) or external cache memory.
- RAM is available in various forms, such as Static Random Access Memory (SRAM) and Dynamic Random Access Memory (DRAM).
- the noise reduction method provided by the present disclosure enables the user to control the headset to exit the active noise reduction mode and activate the environmental noise reduction function to avoid the interference of the environmental noise on the call sound when the user uses the headset to make a call, which is beneficial to provide high-quality voice to the other party of the call. signal, improve the call quality, and has strong industrial practicability.
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Telephone Function (AREA)
Abstract
The present disclosure relates to the field of audio processing. Provided are a noise cancellation method and apparatus, and an audio device and a computer-readable storage medium. The noise cancellation method comprises: when an earphone is in an active noise cancellation mode, if it is detected that a user is on a call, controlling the earphone to exit the active noise cancellation mode; acquiring sound energy of the user and ambient noise energy during a call process; and if it is determined, according to the sound energy and the ambient noise energy, that an ambient noise cancellation starting condition is met, controlling the earphone to start an ambient noise cancellation mode. By using the method, an active noise cancellation function can be automatically turned off during a call, and ambient noise cancellation is performed in an adaptive manner, such that the call quality is improved.
Description
本公开要求于2021年1月18日提交中国专利局、申请号为202110065090.2、发明名称为“降噪方法、装置、音频设备和计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本公开中。This disclosure claims the priority of the Chinese patent application with the application number 202110065090.2 and the invention titled "Noise Reduction Method, Apparatus, Audio Equipment, and Computer-readable Storage Medium" filed with the China Patent Office on January 18, 2021, the entire contents of which are Incorporated in this disclosure by reference.
本公开涉及音频处理技术领域,特别是涉及一种降噪方法、装置、音频设备和计算机可读存储介质。The present disclosure relates to the technical field of audio processing, and in particular, to a noise reduction method, an apparatus, an audio device, and a computer-readable storage medium.
目前的很多音频设备,如耳机,为了满足用户的沉浸式体验,加入了降噪功能,如主动降噪(Active Noise Cancellation,ANC)功能,从而使用户在使用过程中,减少外界噪音的干扰。Many current audio devices, such as headphones, add noise reduction functions, such as Active Noise Cancellation (ANC), in order to satisfy users' immersive experience, so that users can reduce the interference of external noise during use.
然而,在追求沉浸式体验的同时,如果用户使用耳机通话,而耳机没有自动去掉主动降噪功能,则用户在通话时就没有背景底噪声音作为参考,发声大小就无法很好的控制,可能会由于声音过大而影响周围的人,也会让通话对方感到很突兀,引起对方不适。并且,由于环境噪声的存在,可能会对通话声音造成干扰,导致通话对方无法接收到优质的语音信号,从而降低通话质量。However, while pursuing an immersive experience, if the user uses the headset to talk, and the headset does not automatically remove the active noise reduction function, the user will not have background noise as a reference during the call, and the volume of the sound cannot be well controlled. It will affect the surrounding people due to the loud sound, and it will also make the other party feel awkward and cause discomfort to the other party. In addition, due to the existence of environmental noise, it may interfere with the voice of the call, so that the other party of the call cannot receive high-quality voice signals, thereby reducing the quality of the call.
发明内容SUMMARY OF THE INVENTION
(一)要解决的技术问题(1) Technical problems to be solved
现有技术中用户在使用具有主动降噪功能的耳机进行通话时,环境噪声的存在可能会对通话声音造成干扰,导致通话对方无法接收到优质的语音信号,降低了通话质量。In the prior art, when a user uses a headset with an active noise reduction function to make a call, the existence of environmental noise may interfere with the call sound, so that the other party of the call cannot receive high-quality voice signals, which reduces the call quality.
(二)技术方案(2) Technical solutions
基于此,有必要针对上述技术问题,提供一种能够在通话时自动 关闭主动降噪功能,且自适应进行环境降噪,从而提高通话质量的降噪方法、装置、音频设备和计算机可读存储介质。Based on this, it is necessary to provide a noise reduction method, device, audio device and computer-readable storage that can automatically turn off the active noise reduction function during a call and adaptively perform environmental noise reduction, thereby improving the quality of the call. medium.
本公开实施例提供了一种降噪方法,应用于耳机,所述方法包括:An embodiment of the present disclosure provides a noise reduction method, which is applied to an earphone, and the method includes:
所述耳机处于主动降噪模式时,若检测到用户正在通话,则控制所述耳机退出所述主动降噪模式;When the headset is in the active noise reduction mode, if it is detected that the user is talking, controlling the headset to exit the active noise reduction mode;
获取通话过程中所述用户的声音能量和环境噪音能量;Obtain the voice energy and ambient noise energy of the user during the call;
若根据所述声音能量和环境噪音能量确定满足启动环境降噪条件,则控制所述耳机启动环境降噪模式。If it is determined according to the sound energy and the ambient noise energy that the conditions for enabling ambient noise reduction are met, the earphone is controlled to enable the ambient noise reduction mode.
在一个实施例中,所述检测到用户正在通话的过程,包括:In one embodiment, the process of detecting that the user is talking includes:
确定所述用户为目标用户;determining that the user is a target user;
所述目标用户在通话过程中的音量高于预设的音量阈值,则判定检测到所述用户正在通话。If the volume of the target user during the call is higher than a preset volume threshold, it is determined that the user is on a call.
在一个实施例中,所述确定所述用户为目标用户的过程,包括:In one embodiment, the process of determining that the user is a target user includes:
识别所述用户在通话过程中的声纹特征;Identify the voiceprint feature of the user during the call;
将所述声纹特征与预设的对应于所述目标用户的基准声纹进行匹配;matching the voiceprint feature with a preset reference voiceprint corresponding to the target user;
若匹配度高于预设的匹配度阈值,则确定所述用户为所述目标用户。If the matching degree is higher than a preset matching degree threshold, it is determined that the user is the target user.
在一个实施例中,所述根据所述声音能量和环境噪音能量确定满足启动环境降噪条件的过程,包括:In one embodiment, the process of determining, according to the sound energy and ambient noise energy, that the conditions for enabling ambient noise reduction are met includes:
若所述用户的声音能量与所述环境噪声能量的比值大于预设的信噪比阈值,则确定满足所述启动环境降噪条件。If the ratio of the user's sound energy to the environmental noise energy is greater than a preset signal-to-noise ratio threshold, it is determined that the activation environment noise reduction condition is satisfied.
本公开实施例提供了一种降噪装置,应用于耳机,所述装置包括:An embodiment of the present disclosure provides a noise reduction device, which is applied to an earphone, and the device includes:
第一控制模块,用于在所述耳机处于主动降噪模式下,当检测到用户正在通话时,控制所述耳机退出所述主动降噪模式;a first control module, configured to control the headset to exit the active noise reduction mode when it is detected that the user is talking when the headset is in the active noise reduction mode;
获取模块,用于获取通话过程中所述用户的声音能量和环境噪音能量;an acquisition module for acquiring the user's voice energy and ambient noise energy during the call;
第二控制模块,用于根据所述声音能量和环境噪音能量确定满足启动环境降噪条件时,控制所述耳机启动环境降噪模式。The second control module is configured to control the earphone to activate the ambient noise reduction mode when it is determined according to the sound energy and the ambient noise energy that the conditions for enabling ambient noise reduction are satisfied.
本公开实施例提供了一种音频设备,包括存储器和处理器,所述 存储器存储有计算机程序,所述处理器执行所述计算机程序时实现本公开任意实施例所提供的降噪方法的步骤。An embodiment of the present disclosure provides an audio device, including a memory and a processor, where the memory stores a computer program, and the processor implements the steps of the noise reduction method provided by any embodiment of the present disclosure when the processor executes the computer program.
本公开实施例提供了一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现本公开任意实施例所提供的降噪方法的步骤。An embodiment of the present disclosure provides a computer-readable storage medium on which a computer program is stored, and when the computer program is executed by a processor, implements the steps of the noise reduction method provided by any embodiment of the present disclosure.
(三)有益效果(3) Beneficial effects
本公开实施例提供的技术方案与现有技术相比具有如下优点:Compared with the prior art, the technical solutions provided by the embodiments of the present disclosure have the following advantages:
本公开实施例所提供的降噪方法、装置、音频设备和计算机可读存储介质,检测到用户正在使用耳机通话时,控制耳机退出主动降噪模式,即在通话时自动关闭主动降噪功能,从而为用户提供背景底噪作为参考,利于用户更好的控制音量大小,避免由于声音过大而引起周围人或通话对方的不适感。另外,根据获取的通话过程中用户的声音能量和环境噪音能量确定满足启动环境降噪条件时,控制耳机启动环境降噪功能,即实现自适应环境降噪,从而可避免环境噪声对通话声音的干扰,利于向通话对方提高优质的语音信号,从而提高了通话质量。The noise reduction method, device, audio device, and computer-readable storage medium provided by the embodiments of the present disclosure control the headset to exit the active noise reduction mode when it is detected that the user is using the headset to talk, that is, the active noise reduction function is automatically turned off during the call, Therefore, the background noise is provided for the user as a reference, which is beneficial for the user to better control the volume, and avoid discomfort caused by the surrounding people or the other party of the call due to the loud sound. In addition, according to the acquired sound energy and ambient noise energy of the user during the call, when it is determined that the conditions for enabling ambient noise reduction are met, the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
图1为一个实施例中降噪方法的流程示意图;1 is a schematic flowchart of a noise reduction method in one embodiment;
图2为一个实施例中检测用户正在通话的过程的流程示意图;2 is a schematic flowchart of a process of detecting that a user is talking in an embodiment;
图3为一个实施例中确定用户为目标用户的过程的流程示意图;3 is a schematic flowchart of a process of determining a user as a target user in one embodiment;
图4为一个实施例中降噪装置的结构框图;4 is a structural block diagram of a noise reduction device in one embodiment;
图5为一个实施例中音频设备的内部结构图。FIG. 5 is an internal structure diagram of an audio device in one embodiment.
为了使本公开的目的、技术方案及优点更加清楚明白,以下结合附图及实施例,对本公开进行进一步详细说明。应当理解,此处描述的具体实施例仅仅用以解释本公开,并不用于限定本公开。In order to make the objectives, technical solutions and advantages of the present disclosure more clear, the present disclosure will be further described in detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are only used to explain the present disclosure, but not to limit the present disclosure.
<实施例1><Example 1>
在一个实施例中,如图1所示,提供了一种降噪方法。本实施例主要以该降噪方法应用于音频设备,如耳机来举例说明。可以理解的 是,该耳机可以为有线耳机,也可以为采用无线通信的耳机,如蓝牙耳机等。In one embodiment, as shown in FIG. 1, a noise reduction method is provided. In this embodiment, the noise reduction method is mainly applied to an audio device, such as an earphone, for illustration. It can be understood that the earphone can be a wired earphone or a wireless communication earphone, such as a bluetooth earphone.
如图1所示,该降噪方法包括以下步骤:As shown in Figure 1, the noise reduction method includes the following steps:
步骤S1:耳机处于主动降噪模式时,若检测到用户正在通话,则控制耳机退出主动降噪模式。Step S1: When the headset is in the active noise reduction mode, if it is detected that the user is talking, the headset is controlled to exit the active noise reduction mode.
具体的,耳机处于主动降噪模式,即耳机开启了主动降噪功能,此时,耳机会模拟出一种和噪音一样的频率,且反相位的声波,可部分或全部抵消噪音,从而可有效减少外界噪音的干扰,为用户提供一种沉浸式体验。例如,在用户听音乐或看视频时,耳机处于主动降噪模式,从而为用户提供良好的音效体验。Specifically, the earphone is in the active noise reduction mode, that is, the active noise reduction function of the earphone is turned on. At this time, the earphone will simulate a sound wave of the same frequency as the noise, and the anti-phase sound wave can partially or completely cancel the noise, so that the noise can be partially or completely canceled. Effectively reduce the interference of external noise and provide users with an immersive experience. For example, when a user is listening to music or watching a video, the headset is in active noise cancellation mode, thereby providing the user with a good sound experience.
可以理解的是,人在说话时,基于背景底噪声音,可清晰感知自身发出的音量大小,从而可有效控制说话音量,发出比较均衡的声音大小。在采用耳机通话时,若继续开启主动降噪功能,则由于主动降噪功能大大降低了背景底噪声音,使得用户无法清晰感知自身发出的音量大小,进而不能有效控制说话音量,可能会由于通话声音过大而影响周围的人,也会给通话对方带来不适感,或者可能会由于通话声音过小而导致通话对方听不清楚,进而影响通话进程和通话感受。因而,在本公开实施例中,检测用户是否正在通话,若用户正在通话,则控制耳机退出主动降噪模式,即关闭主动降噪功能,不再降低背景底噪声音,呈现出自然噪音状态,使得用户能够清晰感知自身发出的音量大小,为用户说话音量提供对比参考,进而有效控制说话音量,避免由于通话声音过大而给周围人和通话对方带来的不适感,或者由于通话声音过小而导致通话对方听不清楚,进而影响通话进程和通话感受的情况,从而保证用户通话声音的均衡性。若检测到用户未在通话,则不执行相应动作,即耳机不退出主动降噪模式,继续开启主动降噪功能。It is understandable that when people speak, based on the background noise, they can clearly perceive the volume they emit, so that they can effectively control the volume of speech and emit a relatively balanced voice. When using the headset to talk, if you continue to turn on the active noise reduction function, the active noise reduction function greatly reduces the background noise, so that the user cannot clearly perceive the volume emitted by himself, and thus cannot effectively control the speaking volume. If the sound is too loud and affects the people around you, it will also bring discomfort to the caller, or the caller may not be able to hear clearly due to the low volume of the call, thereby affecting the call process and call experience. Therefore, in the embodiment of the present disclosure, it is detected whether the user is talking, and if the user is talking, the headset is controlled to exit the active noise reduction mode, that is, the active noise reduction function is turned off, the background noise is no longer reduced, and a natural noise state is presented. It enables users to clearly perceive the volume of their own voice, provides a reference for the comparison of the user's speaking volume, and then effectively controls the speaking volume, avoiding the discomfort caused to the surrounding people and the other party due to the excessively loud voice of the call, or because the voice of the call is too low As a result, the other party of the call cannot hear clearly, which in turn affects the call process and the call experience, thereby ensuring the balance of the user's call sound. If it is detected that the user is not talking, the corresponding action is not performed, that is, the headset does not exit the active noise reduction mode, and the active noise reduction function continues to be turned on.
步骤S2:获取通话过程中用户的声音能量和用户所处环境的环境噪音能量。Step S2: Acquire the sound energy of the user and the ambient noise energy of the environment where the user is located during the call.
具体的,可通过麦克风等音频采集处理装置采集通话过程中的用户声音和用户所处环境的环境噪音,分析得到各自对应的音量大小。 具体的,可分析得到用户声音对应的声波能量,即用户的声音能量,以及用户所处环境的环境噪音对应的声波能量,即环境噪音能量。Specifically, the user's voice during the call and the environmental noise of the environment where the user is located can be collected through an audio collection and processing device such as a microphone, and the corresponding volume levels can be obtained through analysis. Specifically, the sound wave energy corresponding to the user's voice, that is, the user's sound energy, and the sound wave energy corresponding to the ambient noise in the environment where the user is located, that is, the ambient noise energy, can be obtained through analysis.
步骤S3:若根据声音能量和环境噪音能量确定满足启动环境降噪条件,则控制耳机启动环境降噪模式。Step S3: If it is determined according to the sound energy and the environmental noise energy that the conditions for enabling environmental noise reduction are met, control the earphone to start the environmental noise reduction mode.
具体的,耳机启动环境降噪(Environmental Noise Cancellation,ENC)模式,即耳机可通过双麦克风阵列,精准计算用户说话的方位,在保护主方向目标语音的同时,去除环境中的各种干扰噪声。Specifically, the headset activates the Environmental Noise Cancellation (ENC) mode, that is, the headset can accurately calculate the direction of the user's speech through the dual-microphone array, and remove various interfering noises in the environment while protecting the target voice in the main direction.
可以理解的是,在用户通过耳机通话时,环境中存在多种噪声,可能会对通话声音造成干扰,导致通话对方无法接收到优质的语音信号,从而降低通话质量。因而,在本公开的实施例中,在用户通过耳机通话时,根据上述步骤S2中得到的声音能量和环境噪音能量判断是否满足启动环境降噪条件,若满足启动环境降噪条件,则控制耳机启动环境降噪模式,从而对环境中的多种噪声进行抑制或消除,实现自适应环境降噪,提通话过程中的环境噪声抗干扰性,确保为通话对方提供优质语音信号,从而提高通话质量。若不满足启动环境降噪条件,则不执行相应动作,即不启动环境降噪模式。It is understandable that when the user is talking through the headset, there are various noises in the environment, which may interfere with the voice of the call, so that the other party of the call cannot receive high-quality voice signals, thereby reducing the quality of the call. Therefore, in the embodiment of the present disclosure, when the user talks through the headset, it is judged whether the conditions for enabling environmental noise reduction are satisfied according to the sound energy and ambient noise energy obtained in the above step S2, and if the conditions for enabling environmental noise reduction are satisfied, the headset is controlled Activate the environmental noise reduction mode to suppress or eliminate various noises in the environment, realize adaptive environmental noise reduction, improve the anti-interference of environmental noise during the call, and ensure that high-quality voice signals are provided for the caller, thereby improving call quality. . If the conditions for enabling ambient noise reduction are not met, the corresponding action is not performed, that is, the ambient noise reduction mode is not activated.
上述降噪方法中,检测到用户正在使用耳机通话时,控制耳机退出主动降噪模式,即在通话时自动关闭主动降噪功能,从而为用户提供背景底噪作为参考,利于用户更好的控制音量大小,避免由于声音过大而引起周围人或通话对方的不适感。另外,根据获取的通话过程中用户的声音能量和环境噪音能量确定满足启动环境降噪条件时,控制耳机启动环境降噪功能,即实现自适应环境降噪,从而可避免环境噪声对通话声音的干扰,利于向通话对方提高优质的语音信号,从而提高了通话质量。In the above noise reduction method, when it is detected that the user is using the headset to talk, the headset is controlled to exit the active noise reduction mode, that is, the active noise reduction function is automatically turned off during the call, so as to provide the user with background noise as a reference, which is conducive to better control by the user. The volume is too loud to avoid discomfort to the people around you or the other party in the call. In addition, according to the acquired sound energy and ambient noise energy of the user during the call, when it is determined that the conditions for enabling ambient noise reduction are met, the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
在本公开的一个实施例中,如图2所示,步骤S1中,检测到用户正在通话的过程,包括以下步骤:In an embodiment of the present disclosure, as shown in FIG. 2 , in step S1, the process of detecting that the user is on a call includes the following steps:
步骤S11:确定用户为目标用户。Step S11: Determine the user as the target user.
具体的,目标用户例如为耳机所属用户,即判定当前正在通话的用户为耳机所属用户,从而避免发生误检测而造成的误操作,提高了可靠性。例如检测到其他正在通话的人时,控制耳机退出主动降噪模 式,造成误操作。Specifically, the target user is, for example, the user to which the headset belongs, that is, it is determined that the user currently talking is the user to which the headset belongs, so as to avoid misoperation caused by false detection and improve reliability. For example, when it detects other people who are talking, control the headset to exit the active noise reduction mode, resulting in misoperation.
可以理解的是,在一些空间比较小的地方或人流比较密集的区域,如商城等,人数较多,且人群密度大,相互之间的间距比较小。此时,若用户并未通话,而是在听音乐,则此时耳机处于主动降噪模式,若用户周围有人正在通话,由于彼此间距较小,此时耳机可能识别到通话声音,误认为是用户正在通话,则可能会退出主动降噪模式,造成误操作,而事实上,用户正在听音乐,此时退出主动降噪模式,会极大影响用户的使用感受,降低用户使用体验。因而,本公开实施例中,首选确定正在通话的用户为目标用户,即耳机所属用户,从而可防止误识别、误操作导致的影响用户使用感受和使用体验的情况,提高了可靠性。It is understandable that in some places with relatively small space or areas with relatively dense crowds, such as shopping malls, there are many people, and the crowd density is large, and the distance between them is relatively small. At this time, if the user is not talking, but listening to music, the headset is in active noise reduction mode. If someone is talking around the user, the headset may recognize the voice of the call and mistakenly think it is due to the small distance between them. When the user is on a call, he may exit the active noise reduction mode, causing misoperation. In fact, the user is listening to music, and exiting the active noise reduction mode at this time will greatly affect the user's experience and reduce the user's experience. Therefore, in the embodiment of the present disclosure, it is preferred to determine the user who is talking as the target user, that is, the user to which the headset belongs, so as to prevent misidentification and misoperation from affecting the user experience and experience, and improve reliability.
换言之,步骤S11,即确定正在通话的人是用户本人,而不是其他人。In other words, in step S11, it is determined that the person who is talking is the user himself, not other people.
步骤S12:目标用户在通话过程中的音量高于预设的音量阈值,则判定检测到用户正在通话。Step S12: If the volume of the target user during the call is higher than the preset volume threshold, it is determined that the user is on a call.
具体的,在确定正在通话的用户为目标用户后,还需要确定目标用户在通话过程中的声音大小是否满足一定条件,即判断目标用户在通话过程中的音量是否高于预设的音量阈值,若是,则判定目标用户正在通话,从而,进一步避免误识别和误操作,提高了可靠性。若否,即目标用户在通话过程中的音量小于或等于预设的音量阈值时,判定目标用户未通话,则不执行任何操作,耳机继续保持主动降噪模式。Specifically, after it is determined that the user who is on the call is the target user, it is also necessary to determine whether the volume of the target user's voice during the call satisfies a certain condition, that is, to determine whether the volume of the target user during the call is higher than the preset volume threshold. If so, it is determined that the target user is talking, thereby further avoiding misidentification and misoperation, and improving reliability. If not, that is, when the target user's volume during the call is less than or equal to the preset volume threshold, it is determined that the target user is not talking, no operation is performed, and the headset continues to maintain the active noise reduction mode.
可以理解的是,正常通话过程中,用户的说话声音会达到一定程度,如达到预设的音量阈值,利于通话对方能够清晰接收到声音信号。而若用户声音较小,则用户很可能不是在通话,例如用户在和周围人小声交谈,或者用户当前正在通过耳机听音乐,并跟随音乐轻声跟唱,若此时将该音量较小的声音误识别为通话声音,则会造成误操作,即耳机退出主动降噪模式,从而影响用户的使用感受和使用体验。因而,在本公开的实施例中,在确定目标用户后,还需进一步判断用户的音量是否高于预设的音量阈值,当确定用户的音量高于预设的音量阈值后,判定用户正在通话,此时,耳机退出主动降噪模式,从而进一步 避免误识别和误操作,避免影响用户使用感受和使用体验,提高了可靠性。It is understandable that during a normal call, the user's speaking voice will reach a certain level, such as reaching a preset volume threshold, which is beneficial for the other party to receive the sound signal clearly. However, if the user's voice is low, the user is probably not talking. For example, the user is talking with the surrounding people in a low voice, or the user is currently listening to music through headphones and sings along with the music softly. If it is mistakenly recognized as a call sound, it will cause a misoperation, that is, the headset will exit the active noise reduction mode, which will affect the user's experience and experience. Therefore, in the embodiment of the present disclosure, after determining the target user, it is necessary to further determine whether the user's volume is higher than the preset volume threshold, and after determining that the user's volume is higher than the preset volume threshold, determine that the user is talking , at this time, the headset exits the active noise reduction mode, thereby further avoiding misidentification and misoperation, avoiding affecting the user's experience and experience, and improving reliability.
在具体示例中,预设的音量阈值例如为预先设置的音量阈值,其取值依据例如为尽量准确判定目标用户是否正在通话。在具体实施例中,预设的音量阈值例如为40分贝,也即在识别到目标用户通话过程中的音量大于40分贝时,判定该用户正在通话。否则,认为用户未在通话,耳机继续保持主动降噪模式。In a specific example, the preset volume threshold is, for example, a preset volume threshold, which is based on, for example, determining as accurately as possible whether the target user is on a call. In a specific embodiment, the preset volume threshold is, for example, 40 decibels, that is, when it is recognized that the volume of the target user during the call is greater than 40 decibels, it is determined that the user is talking. Otherwise, it is considered that the user is not talking, and the headset continues to maintain the active noise reduction mode.
在本公开的一个实施例中,如图3所示,步骤S11中,确定用户为目标用户的过程,包括以下步骤:In an embodiment of the present disclosure, as shown in FIG. 3 , in step S11, the process of determining the user as the target user includes the following steps:
步骤S111:识别用户在通话过程中的声纹特征。Step S111: Identify the voiceprint feature of the user during the call.
步骤S112:将声纹特征与预设的对应于目标用户的基准声纹进行匹配。Step S112: Match the voiceprint feature with a preset reference voiceprint corresponding to the target user.
具体的,例如在耳机中预先存储有目标用户,即耳机所属用户的基准声纹,进而,将通话过程中实际采集到的用户的声纹特征与预存的目标用户的基准声纹特征进行匹配,利于准确确定该用户是否为目标用户,即是否为耳机所属用户。Specifically, for example, the target user, that is, the reference voiceprint of the user to which the headset belongs is pre-stored in the headset, and then the user's voiceprint feature actually collected during the call is matched with the pre-stored reference voiceprint feature of the target user, It is beneficial to accurately determine whether the user is the target user, that is, whether the user belongs to the headset.
步骤S113:若匹配度高于预设的匹配度阈值,则确定用户为目标用户。Step S113: If the matching degree is higher than the preset matching degree threshold, determine the user as the target user.
具体的,即通话过程中实际采集到的用户的声纹特征与预存的目标用户的基准声纹特征的匹配度高于预设的匹配度阈值,则确定该用户为目标用户,即耳机所属用户,从而可准确识别用户身份,确定目标用户,防止误识别。Specifically, that is, if the matching degree between the voiceprint feature of the user actually collected during the call and the pre-stored target user's reference voiceprint feature is higher than the preset matching degree threshold, the user is determined to be the target user, that is, the user to which the headset belongs. , so that the user identity can be accurately identified, the target user can be determined, and misidentification can be prevented.
在具体实施例中,预设的匹配度阈值例如为预先设定的值,其取值依据例如为尽量准确判定正在通话的用户是否为目标用户。可以理解的是,预设的匹配度阈值取值不宜过高,也不宜过低,取值太高可能由于环境扰动,不能识别出目标用户,出现弃真误差;取值太低,又可能将不是目标用户的其他用户的声音识别为目标用户,从而出现纳伪的误差。In a specific embodiment, the preset matching degree threshold is, for example, a preset value, and its value is based on, for example, determining as accurately as possible whether the user who is talking is the target user. It is understandable that the preset matching degree threshold value should not be too high or too low. If the value is too high, the target user may not be recognized due to environmental disturbances, and there will be an error of abandonment; if the value is too low, it may The voices of other users who are not the target user are recognized as the target user, resulting in a false acceptance error.
在具体实施例中,预设的匹配度阈值例如为95%。In a specific embodiment, the preset matching degree threshold is, for example, 95%.
在具体实施例中,检测用户是否正在通话的过程例如包括:首先, 识别通话用户为目标用户,而不是其他用户。例如,在耳机中预先记录有目标用户的声纹特征,即基准声纹,将通话过程中实际采集的用户的声纹特征与该基准声纹进行对比,若对比结果达到一定的置信度水平,如95%,则确定正在通话的该用户即为目标用户,也即耳机所属用户,从而有效避免误识别,提高目标用户的识别精度。进一步地,识别目标用户的音量是否高于预设的音量阈值,如40分贝,若是,则确定用户正在打电话,此时,耳机退出主动降噪模式。否则,认为用户未在通话,耳机继续保持主动降噪模式。In a specific embodiment, the process of detecting whether the user is on a call includes, for example: first, identifying the calling user as a target user, not other users. For example, the voiceprint feature of the target user, that is, the reference voiceprint, is pre-recorded in the headset, and the user's voiceprint feature actually collected during the call is compared with the reference voiceprint. If the comparison result reaches a certain confidence level, If it is 95%, it is determined that the user who is talking is the target user, that is, the user to which the headset belongs, thereby effectively avoiding misrecognition and improving the recognition accuracy of the target user. Further, it is identified whether the volume of the target user is higher than a preset volume threshold, such as 40 decibels, and if so, it is determined that the user is making a call, and at this time, the headset exits the active noise reduction mode. Otherwise, it is considered that the user is not talking, and the headset continues to maintain the active noise reduction mode.
在本公开的一个实施例中,步骤S3中,根据声音能量和环境噪音能量确定满足启动环境降噪条件的过程,包括:若用户的声音能量与环境噪声能量的比值大于预设的信噪比阈值,则确定满足启动环境降噪条件,则耳机启动环境降噪模式。否则,即用户的声音能量与环境噪声能量的比值小于或等于预设的信噪比阈值,则确定不满足启动环境降噪条件,则耳机不启动环境降噪模式。In an embodiment of the present disclosure, in step S3, determining a process that satisfies the conditions for enabling environmental noise reduction according to the sound energy and the ambient noise energy includes: if the ratio of the user's sound energy to the ambient noise energy is greater than a preset signal-to-noise ratio If the threshold is set, it is determined that the conditions for enabling ambient noise reduction are met, and the headset starts the ambient noise reduction mode. Otherwise, that is, the ratio of the user's sound energy to the ambient noise energy is less than or equal to the preset signal-to-noise ratio threshold, it is determined that the conditions for enabling ambient noise reduction are not met, and the headset does not enable the ambient noise reduction mode.
在具体实施例中,预设的信噪比阈值为预先设置的值,其取值依据例如为根据声音能量和环境噪声能量之间的关系尽量准确判定耳机是否满足开启环境降噪条件。在具体示例中,预设的信噪比阈值例如为大于1的值,例如1.5。In a specific embodiment, the preset signal-to-noise ratio threshold is a preset value, which is based on, for example, determining as accurately as possible whether the earphone satisfies the ambient noise reduction condition according to the relationship between sound energy and ambient noise energy. In a specific example, the preset signal-to-noise ratio threshold is, for example, a value greater than 1, such as 1.5.
具体而言,在得到用户的声音能量和环境噪声能量后,可根据声音能量和环境噪声能量设置一个信噪比阈值,表示为:声音能量/环境噪声能量,即声音能量与环境噪声能量的比值。可以理解的是,环境降噪过程对有用声波信号具有自损性,所以当用户的声音能量小于或等于环境噪声能量时,一般不开启环境降噪模式,否则用户的声音会被消除。换言之,该预设的信噪比阈值一般设置为大于1的值,例如1.5。也即是说,当检测到实际信噪比阈值,即采集的声音能量与环境噪声能量的比值大于预设的信噪比阈值,例如1.5时,则自动启动环境降噪模式,以便降低环境噪声的干扰,从而为通话对方提供清晰、优质的语音信号,提高通话质量。Specifically, after obtaining the user's sound energy and environmental noise energy, a signal-to-noise ratio threshold can be set according to the sound energy and environmental noise energy, expressed as: sound energy/environmental noise energy, that is, the ratio of sound energy to environmental noise energy . It is understandable that the environmental noise reduction process is self-destructive to the useful sound wave signal, so when the user's sound energy is less than or equal to the environmental noise energy, the environmental noise reduction mode is generally not turned on, otherwise the user's voice will be eliminated. In other words, the preset signal-to-noise ratio threshold is generally set to a value greater than 1, such as 1.5. That is to say, when the actual signal-to-noise ratio threshold is detected, that is, the ratio of the collected sound energy to the ambient noise energy is greater than the preset signal-to-noise ratio threshold, such as 1.5, the ambient noise reduction mode is automatically activated to reduce ambient noise. Therefore, it can provide clear and high-quality voice signals for the other party and improve the quality of the call.
从而,本公开实施例的降噪方法,在使用耳机通话时,若检测到用户自己在讲话,则自动关闭主动降噪ANC功能,呈现出自然的背景 底噪状态,以便用户有效控制音量;进一步地,能根据目前的环境噪音,自动判断是否启动环境降噪ENC功能,从而利于为通话对方能提供相对优质的语音信号,提高通话质量。Therefore, in the noise reduction method according to the embodiment of the present disclosure, when it is detected that the user is speaking by himself, the active noise reduction ANC function is automatically turned off to present a natural background noise state, so that the user can effectively control the volume; further It can automatically determine whether to activate the environmental noise reduction ENC function according to the current environmental noise, so as to provide a relatively high-quality voice signal for the caller and improve the call quality.
应该理解的是,虽然图1-3的流程图中的各个步骤按照箭头的指示依次显示,但是这些步骤并不是必然按照箭头指示的顺序依次执行。除非本文中有明确的说明,这些步骤的执行并没有严格的顺序限制,这些步骤可以以其它的顺序执行。而且,图1-3中的至少一部分步骤可以包括多个子步骤或者多个阶段,这些子步骤或者阶段并不必然是在同一时刻执行完成,而是可以在不同的时刻执行,这些子步骤或者阶段的执行顺序也不必然是依次进行,而是可以与其它步骤或者其它步骤的子步骤或者阶段的至少一部分轮流或者交替地执行。It should be understood that although the steps in the flowcharts of FIGS. 1-3 are shown in sequence according to the arrows, these steps are not necessarily executed in the sequence shown by the arrows. Unless explicitly stated herein, the execution of these steps is not strictly limited to the order, and these steps may be performed in other orders. Moreover, at least a part of the steps in FIGS. 1-3 may include multiple sub-steps or multiple stages. These sub-steps or stages are not necessarily executed and completed at the same time, but may be executed at different times. These sub-steps or stages are not necessarily completed at the same time. The order of execution of the steps is not necessarily sequential, but may be performed alternately or alternately with other steps or at least a part of sub-steps or stages of other steps.
<实施例2><Example 2>
在一个实施例中,如图4所示,提供了一种降噪装置。本实施例主要以该降噪装置应用于音频设备,如耳机来举例说明。可以理解的是,该耳机可以为有线耳机,也可以为采用无线通信的耳机,如蓝牙耳机等。In one embodiment, as shown in FIG. 4, a noise reduction device is provided. In this embodiment, the noise reduction device is mainly applied to an audio device, such as an earphone, for illustration. It can be understood that the earphone may be a wired earphone, or a earphone using wireless communication, such as a Bluetooth earphone.
如图4所示,该降噪装置100,包括:第一控制模块110、获取模块120和第二控制模块130。As shown in FIG. 4 , the noise reduction device 100 includes: a first control module 110 , an acquisition module 120 and a second control module 130 .
第一控制模块110,用于在耳机处于主动降噪模式下,当检测到用户正在通话时,控制耳机退出主动降噪模式。The first control module 110 is configured to control the headset to exit the active noise reduction mode when it is detected that the user is talking when the headset is in the active noise reduction mode.
具体的,耳机处于主动降噪模式,即耳机开启了主动降噪功能,此时,耳机会模拟出一种和噪音一样的频率,且反相位的声波,可部分或全部抵消噪音,从而可有效减少外界噪音的干扰,为用户提供一种沉浸式体验。例如,在用户听音乐或看视频时,耳机处于主动降噪模式,从而为用户提供良好的音效体验。Specifically, the earphone is in the active noise reduction mode, that is, the active noise reduction function of the earphone is turned on. At this time, the earphone will simulate a sound wave of the same frequency as the noise, and the anti-phase sound wave can partially or completely cancel the noise, so that the noise can be partially or completely canceled. Effectively reduce the interference of external noise and provide users with an immersive experience. For example, when a user is listening to music or watching a video, the headset is in active noise cancellation mode, thereby providing the user with a good sound experience.
可以理解的是,人在说话时,基于背景底噪声音,可清晰感知自身发出的音量大小,从而可有效控制说话音量,发出比较均衡的声音大小。在采用耳机通话时,若继续开启主动降噪功能,则由于主动降噪功能大大降低了背景底噪声音,使得用户无法清晰感知自身发出的音量大小,进而不能有效控制说话音量,可能会由于通话声音过大而 影响周围的人,也会给通话对方带来不适感,或者可能会由于通话声音过小而导致通话对方听不清楚,进而影响通话进程和通话感受。因而,在本公开实施例中,检测用户是否正在通话,若用户正在通话,则控制耳机退出主动降噪模式,即关闭主动降噪功能,不再降低背景底噪声音,呈现出自然噪音状态,使得用户能够清晰感知自身发出的音量大小,为用户说话音量提供对比参考,进而有效控制说话音量,避免由于通话声音过大而给周围人和通话对方带来的不适感,或者由于通话声音过小而导致通话对方听不清楚,进而影响通话进程和通话感受的情况,从而保证用户通话声音的均衡性。若检测到用户未在通话,则不执行相应动作,即耳机不退出主动降噪模式,继续开启主动降噪功能。It is understandable that when people speak, based on the background noise, they can clearly perceive the volume they emit, so that they can effectively control the volume of speech and emit a relatively balanced voice. When using the headset to talk, if you continue to turn on the active noise reduction function, the active noise reduction function greatly reduces the background noise, so that the user cannot clearly perceive the volume emitted by himself, and thus cannot effectively control the speaking volume. If the sound is too loud and affects the people around you, it will also bring discomfort to the caller, or the caller may not be able to hear clearly due to the low volume of the call, thereby affecting the call process and call experience. Therefore, in the embodiment of the present disclosure, it is detected whether the user is talking, and if the user is talking, the headset is controlled to exit the active noise reduction mode, that is, the active noise reduction function is turned off, the background noise is no longer reduced, and a natural noise state is presented. It enables users to clearly perceive the volume of their own voice, provides a reference for the comparison of the user's speaking volume, and then effectively controls the speaking volume, avoiding the discomfort caused to the surrounding people and the other party due to the excessively loud voice of the call, or because the voice of the call is too low As a result, the other party of the call cannot hear clearly, which in turn affects the call process and the call experience, thereby ensuring the balance of the user's call sound. If it is detected that the user is not talking, the corresponding action is not performed, that is, the headset does not exit the active noise reduction mode, and the active noise reduction function continues to be turned on.
获取模块120,用于获取通话过程中用户的声音能量和用户所处环境的环境噪音能量。The obtaining module 120 is configured to obtain the sound energy of the user and the ambient noise energy of the environment where the user is located during the call.
具体的,获取模块120例如包括麦克风等音频采集处理装置,可通过麦克风等音频采集处理装置采集通话过程中的用户声音和用户所处环境的环境噪音,分析得到各自对应的音量大小。具体的,可分析得到用户声音对应的声波能量,即用户的声音能量,以及用户所处环境的环境噪音对应的声波能量,即环境噪音能量。Specifically, the acquisition module 120 includes, for example, an audio collection and processing device such as a microphone, which can collect the user's voice during the call and the ambient noise of the user's environment through the microphone and other audio collection and processing devices, and analyze the corresponding volume levels. Specifically, the sound wave energy corresponding to the user's voice, that is, the user's sound energy, and the sound wave energy corresponding to the ambient noise in the environment where the user is located, that is, the ambient noise energy, can be obtained through analysis.
第二控制模块130,用于根据声音能量和环境噪音能量确定满足启动环境降噪条件时,控制耳机启动环境降噪模式。The second control module 130 is configured to control the earphone to activate the ambient noise reduction mode when it is determined according to the sound energy and the ambient noise energy that the conditions for enabling ambient noise reduction are satisfied.
具体的,耳机启动环境降噪模式,即耳机可通过双麦克风阵列,精准计算用户说话的方位,在保护主方向目标语音的同时,去除环境中的各种干扰噪声。Specifically, the headset activates the environmental noise reduction mode, that is, the headset can accurately calculate the direction of the user's speech through the dual-microphone array, and remove various interfering noises in the environment while protecting the target voice in the main direction.
可以理解的是,在用户通过耳机通话时,环境中存在多种噪声,可能会对通话声音造成干扰,导致通话对方无法接收到优质的语音信号,从而降低通话质量。因而,在本公开的实施例中,在用户通过耳机通话时,根据上述获取模块120得到的声音能量和环境噪音能量,第二控制模块130判断是否满足启动环境降噪条件,若满足启动环境降噪条件,则控制耳机启动环境降噪模式,从而对环境中的多种噪声进行抑制或消除,实现自适应环境降噪,提通话过程中的环境噪声抗 干扰性,确保为通话对方提供优质语音信号,从而提高通话质量。若不满足启动环境降噪条件,则不执行相应动作,即不启动环境降噪模式。It is understandable that when the user is talking through the headset, there are various noises in the environment, which may interfere with the voice of the call, so that the other party of the call cannot receive high-quality voice signals, thereby reducing the quality of the call. Therefore, in the embodiment of the present disclosure, when the user talks through the headset, according to the sound energy and ambient noise energy obtained by the acquisition module 120, the second control module 130 determines whether the startup environment noise reduction condition is satisfied. Noise conditions, control the headset to start the environmental noise reduction mode, so as to suppress or eliminate various noises in the environment, realize adaptive environmental noise reduction, improve the anti-interference of environmental noise during the call, and ensure high-quality voice for the other party. signal to improve call quality. If the conditions for enabling ambient noise reduction are not met, the corresponding action is not performed, that is, the ambient noise reduction mode is not activated.
上述降噪装置100中,检测到用户正在使用耳机通话时,控制耳机退出主动降噪模式,即在通话时自动关闭主动降噪功能,从而为用户提供背景底噪作为参考,利于用户更好的控制音量大小,避免由于声音过大而引起周围人或通话对方的不适感。另外,根据获取的通话过程中用户的声音能量和环境噪音能量确定满足启动环境降噪条件时,控制耳机启动环境降噪功能,即实现自适应环境降噪,从而可避免环境噪声对通话声音的干扰,利于向通话对方提高优质的语音信号,从而提高了通话质量。In the above noise reduction device 100, when it is detected that the user is using the headset to talk, the headset is controlled to exit the active noise reduction mode, that is, the active noise reduction function is automatically turned off during the call, so as to provide the user with background noise as a reference, which is beneficial for the user to better. Control the volume to avoid discomfort to the people around you or the caller due to the loud sound. In addition, according to the acquired sound energy and ambient noise energy of the user during the call, when it is determined that the conditions for enabling ambient noise reduction are met, the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
在本公开的一个实施例中,第一控制模块110检测到用户正在通话的过程,具体包括:确定用户为目标用户,且目标用户在通话过程中的音量高于预设的音量阈值时,判定检测到用户正在通话。In an embodiment of the present disclosure, the first control module 110 detects a process in which the user is talking, specifically including: determining that the user is the target user, and when the target user's volume during the call is higher than a preset volume threshold, determining A user is detected on a call.
具体的,目标用户例如为耳机所属用户,即判定当前正在通话的用户为耳机所属用户,从而避免发生误检测而造成的误操作,提高了可靠性。例如检测到其他正在通话的人时,控制耳机退出主动降噪模式,造成误操作。Specifically, the target user is, for example, the user to which the headset belongs, that is, it is determined that the user currently talking is the user to which the headset belongs, so as to avoid misoperation caused by false detection and improve reliability. For example, when it detects other people who are talking, control the headset to exit the active noise reduction mode, causing misoperation.
可以理解的是,在一些空间比较小的地方或人流比较密集的区域,如商城等,人数较多,且人群密度大,相互之间的间距比较小。此时,若用户并未通话,而是在听音乐,则此时耳机处于主动降噪模式,若用户周围有人正在通话,由于彼此间距较小,此时耳机可能识别到通话声音,误认为是用户正在通话,则可能会退出主动降噪模式,造成误操作,而事实上,用户正在听音乐,此时退出主动降噪模式,会极大影响用户的使用感受,降低用户使用体验。因而,本公开实施例中,首选确定正在通话的用户为目标用户,即耳机所属用户,从而可防止误识别、误操作导致的影响用户使用感受和使用体验的情况,提高了可靠性。It is understandable that in some places with relatively small space or areas with relatively dense flow of people, such as shopping malls, there are many people, and the crowd density is large, and the distance between them is relatively small. At this time, if the user is not talking, but listening to music, the headset is in active noise reduction mode. If someone is talking around the user, the headset may recognize the call sound and mistakenly think it is due to the small distance between them. When the user is on a call, he may exit the active noise reduction mode, causing misoperation. In fact, the user is listening to music, and exiting the active noise reduction mode at this time will greatly affect the user's experience and reduce the user's experience. Therefore, in the embodiment of the present disclosure, it is preferred to determine the user who is talking as the target user, that is, the user to which the headset belongs, so as to prevent misidentification and misoperation from affecting the user experience and experience, and improve reliability.
进一步地,在确定正在通话的用户为目标用户后,还需要确定目标用户在通话过程中的声音大小是否满足一定条件,即判断目标用户 在通话过程中的音量是否高于预设的音量阈值,若是,则判定目标用户正在通话,从而,进一步避免误识别和误操作,提高了可靠性。若否,即目标用户在通话过程中的音量小于或等于预设的音量阈值时,判定目标用户未通话,则不执行任何操作,耳机继续保持主动降噪模式。Further, after it is determined that the user who is talking is the target user, it is also necessary to determine whether the volume of the target user during the call satisfies a certain condition, that is, to determine whether the volume of the target user during the call is higher than the preset volume threshold. If so, it is determined that the target user is talking, thereby further avoiding misidentification and misoperation, and improving reliability. If not, that is, when the target user's volume during the call is less than or equal to the preset volume threshold, it is determined that the target user is not talking, no operation is performed, and the headset continues to maintain the active noise reduction mode.
可以理解的是,正常通话过程中,用户的说话声音会达到一定程度,如达到预设的音量阈值,利于通话对方能够清晰接收到声音信号。而若用户声音较小,则用户很可能不是在通话,例如用户在和周围人小声交谈,或者用户当前正在通过耳机听音乐,并跟随音乐轻声跟唱,若此时将该音量较小的声音误识别为通话声音,则会造成误操作,即耳机退出主动降噪模式,从而影响用户的使用感受和使用体验。因而,在本公开的实施例中,在确定目标用户后,还需进一步判断用户的音量是否高于预设的音量阈值,当确定用户的音量高于预设的音量阈值后,判定用户正在通话,此时,耳机退出主动降噪模式,从而进一步避免误识别和误操作,避免影响用户使用感受和使用体验,提高了可靠性。It is understandable that during a normal call, the user's speaking voice will reach a certain level, such as reaching a preset volume threshold, which is beneficial for the other party to receive the sound signal clearly. However, if the user's voice is low, the user is probably not talking. For example, the user is talking with the surrounding people in a low voice, or the user is currently listening to music through headphones and sings along with the music softly. If it is mistakenly recognized as a call sound, it will cause a misoperation, that is, the headset will exit the active noise reduction mode, which will affect the user's experience and experience. Therefore, in the embodiment of the present disclosure, after determining the target user, it is necessary to further determine whether the user's volume is higher than the preset volume threshold, and after determining that the user's volume is higher than the preset volume threshold, determine that the user is talking , at this time, the headset exits the active noise reduction mode, thereby further avoiding misidentification and misoperation, avoiding affecting the user's experience and experience, and improving reliability.
在具体示例中,预设的音量阈值例如为预先设置的音量阈值,其取值依据例如为尽量准确判定目标用户是否正在通话。在具体实施例中,预设的音量阈值例如为40分贝,也即在识别到目标用户通话过程中的音量大于40分贝时,判定该用户正在通话。否则,认为用户未在通话,耳机继续保持主动降噪模式。In a specific example, the preset volume threshold is, for example, a preset volume threshold, which is based on, for example, determining as accurately as possible whether the target user is on a call. In a specific embodiment, the preset volume threshold is, for example, 40 decibels, that is, when it is recognized that the volume of the target user during the call is greater than 40 decibels, it is determined that the user is talking. Otherwise, it is considered that the user is not talking, and the headset continues to maintain the active noise reduction mode.
在本公开的一个实施例中,第一控制模块110确定用户为目标用户的过程,具体包括:识别用户在通话过程中的声纹特征;将声纹特征与预设的对应于目标用户的基准声纹进行匹配;当匹配度高于预设的匹配度阈值时,确定用户为目标用户。In an embodiment of the present disclosure, the process of determining the user as the target user by the first control module 110 specifically includes: identifying the voiceprint feature of the user during the call; comparing the voiceprint feature with a preset benchmark corresponding to the target user The voiceprint is matched; when the matching degree is higher than the preset matching degree threshold, the user is determined as the target user.
具体的,例如在耳机中预先存储有目标用户,即耳机所属用户的基准声纹,进而,将通话过程中实际采集到的用户的声纹特征与预存的目标用户的基准声纹特征进行匹配,利于准确确定该用户是否为目标用户,即是否为耳机所属用户。通话过程中实际采集到的用户的声纹特征与预存的目标用户的基准声纹特征的匹配度高于预设的匹配度 阈值,则确定该用户为目标用户,即耳机所属用户,从而可准确识别用户身份,确定目标用户,防止误识别。Specifically, for example, the target user, that is, the reference voiceprint of the user to which the headset belongs is pre-stored in the headset, and then the user's voiceprint feature actually collected during the call is matched with the pre-stored target user's reference voiceprint feature, It is beneficial to accurately determine whether the user is the target user, that is, whether the user belongs to the headset. If the matching degree between the voiceprint feature of the user actually collected during the call and the reference voiceprint feature of the pre-stored target user is higher than the preset matching degree threshold, the user is determined as the target user, that is, the user to which the headset belongs, so that accurate Identify user identities, determine target users, and prevent misidentification.
在具体实施例中,预设的匹配度阈值例如为预先设定的值,其取值依据例如为尽量准确判定正在通话的用户是否为目标用户。可以理解的是,预设的匹配度阈值取值不宜过高,也不宜过低,取值太高可能由于环境扰动,不能识别出目标用户,出现弃真误差;取值太低,又可能将不是目标用户的其他用户的声音识别为目标用户,从而出现纳伪的误差。In a specific embodiment, the preset matching degree threshold is, for example, a preset value, and its value is based on, for example, determining as accurately as possible whether the user who is talking is the target user. It is understandable that the preset matching degree threshold value should not be too high or too low. If the value is too high, the target user may not be recognized due to environmental disturbances, and there will be an error of abandonment; if the value is too low, it may The voices of other users who are not the target user are recognized as the target user, resulting in a false acceptance error.
在具体实施例中,预设的匹配度阈值例如为95%。In a specific embodiment, the preset matching degree threshold is, for example, 95%.
在具体实施例中,第一控制模块110检测用户是否正在通话的过程例如包括:首先,识别通话用户为目标用户,而不是其他用户。例如,在耳机中预先记录有目标用户的声纹特征,即基准声纹,将通话过程中实际采集的用户的声纹特征与该基准声纹进行对比,若对比结果达到一定的置信度水平,如95%,则确定正在通话的该用户即为目标用户,也即耳机所属用户,从而有效避免误识别,提高目标用户的识别精度。进一步地,识别目标用户的音量是否高于预设的音量阈值,如40分贝,若是,则确定用户正在打电话,此时,耳机退出主动降噪模式。否则,认为用户未在通话,耳机继续保持主动降噪模式。In a specific embodiment, the process of the first control module 110 detecting whether the user is on a call includes, for example: first, identifying the calling user as the target user, not other users. For example, the voiceprint feature of the target user, that is, the reference voiceprint, is pre-recorded in the headset, and the user's voiceprint feature actually collected during the call is compared with the reference voiceprint. If the comparison result reaches a certain level of confidence, If it is 95%, it is determined that the user who is talking is the target user, that is, the user to which the headset belongs, thereby effectively avoiding misrecognition and improving the recognition accuracy of the target user. Further, it is identified whether the volume of the target user is higher than a preset volume threshold, such as 40 decibels, and if so, it is determined that the user is making a call, and at this time, the headset exits the active noise reduction mode. Otherwise, it is considered that the user is not talking, and the headset continues to maintain the active noise reduction mode.
在本公开的一个实施例中,第二控制模块130根据声音能量和环境噪音能量确定满足启动环境降噪条件的过程,具体包括:当用户的声音能量与环境噪声能量的比值大于预设的信噪比阈值时,确定满足启动环境降噪条件,则耳机启动环境降噪模式。否则,即用户的声音能量与环境噪声能量的比值小于或等于预设的信噪比阈值,则确定不满足启动环境降噪条件,则耳机不启动环境降噪模式。In an embodiment of the present disclosure, the second control module 130 determines the process of satisfying the conditions for enabling environmental noise reduction according to the sound energy and the ambient noise energy, and specifically includes: when the ratio of the user's sound energy to the ambient noise energy is greater than a preset signal When the noise ratio threshold is set, it is determined that the conditions for enabling ambient noise reduction are met, and the headset starts the ambient noise reduction mode. Otherwise, that is, the ratio of the user's sound energy to the ambient noise energy is less than or equal to the preset signal-to-noise ratio threshold, it is determined that the conditions for enabling ambient noise reduction are not met, and the headset does not enable the ambient noise reduction mode.
在具体实施例中,预设的信噪比阈值为预先设置的值,其取值依据例如为根据声音能量和环境噪声能量之间的关系尽量准确判定耳机是否满足开启环境降噪条件。在具体示例中,预设的信噪比阈值例如为大于1的值,例如1.5。In a specific embodiment, the preset signal-to-noise ratio threshold is a preset value, which is based on, for example, determining as accurately as possible whether the earphone satisfies the ambient noise reduction condition according to the relationship between sound energy and ambient noise energy. In a specific example, the preset signal-to-noise ratio threshold is, for example, a value greater than 1, such as 1.5.
具体而言,在得到用户的声音能量和环境噪声能量后,可根据声音能量和环境噪声能量设置一个信噪比阈值,表示为:声音能量/环境 噪声能量,即声音能量与环境噪声能量的比值。可以理解的是,环境降噪过程对有用声波信号具有自损性,所以当用户的声音能量小于或等于环境噪声能量时,一般不开启环境降噪模式,否则用户的声音会被消除。换言之,该预设的信噪比阈值预设的信噪比阈值一般设置为大于1的值,例如1.5。也即是说,当检测到实际信噪比阈值,即采集的声音能量与环境噪声能量的比值大于预设的信噪比阈值,例如1.5时,则自动启动环境降噪模式,以便降低环境噪声的干扰,从而为通话对方提供清晰、优质的语音信号,提高通话质量。Specifically, after obtaining the user's sound energy and environmental noise energy, a signal-to-noise ratio threshold can be set according to the sound energy and environmental noise energy, expressed as: sound energy/environmental noise energy, that is, the ratio of sound energy to environmental noise energy . It is understandable that the environmental noise reduction process is self-destructive to the useful sound wave signal, so when the user's sound energy is less than or equal to the environmental noise energy, the environmental noise reduction mode is generally not turned on, otherwise the user's voice will be eliminated. In other words, the preset SNR threshold is generally set to a value greater than 1, such as 1.5. That is to say, when the actual signal-to-noise ratio threshold is detected, that is, the ratio of the collected sound energy to the ambient noise energy is greater than the preset signal-to-noise ratio threshold, such as 1.5, the ambient noise reduction mode is automatically activated to reduce ambient noise. Therefore, it can provide clear and high-quality voice signals for the other party and improve the quality of the call.
从而,本公开实施例的降噪装置100,在使用耳机通话时,若检测到用户自己在讲话,则自动关闭主动降噪ANC功能,呈现出自然的背景底噪状态,以便用户有效控制音量;进一步地,能根据目前的环境噪音,自动判断是否启动环境降噪ENC功能,从而利于为通话对方能提供相对优质的语音信号,提高通话质量。Therefore, the noise reduction device 100 of the embodiment of the present disclosure, when using the headset to talk, if it is detected that the user is talking, the active noise reduction ANC function is automatically turned off, showing a natural background noise state, so that the user can effectively control the volume; Further, it can automatically determine whether to activate the environmental noise reduction ENC function according to the current environmental noise, so as to provide a relatively high-quality voice signal for the other party of the call and improve the quality of the call.
关于降噪装置100的具体限定可以参见上文中对于降噪方法的限定,在此不再赘述。上述降噪装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific definition of the noise reduction apparatus 100, reference may be made to the definition of the noise reduction method above, which will not be repeated here. Each module in the above-mentioned noise reduction device may be implemented in whole or in part by software, hardware and combinations thereof. The above modules can be embedded in or independent of the processor in the computer device in the form of hardware, or stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
<实施例3><Example 3>
在一个实施例中,提供了一种音频设备,该音频设备可以是耳机,其内部结构图可以如图5所示。该音频设备包括通过系统总线连接的处理器、存储器、通信接口、显示屏和输入装置。其中,该音频设备的处理器用于提供计算和控制能力。该音频设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统和计算机程序。该内存储器为非易失性存储介质中的操作系统和计算机程序的运行提供环境。该音频设备的通信接口用于与外部的终端进行有线或无线方式的通信,无线方式可通过WIFI、运营商网络、近场通信(NFC)、蓝牙或其他技术实现。该计算机程序被处理器执行时以实现一种降噪方法。该音频设备的显示屏可以是液晶显示屏或者电子墨水显示屏,该音频设备的输入装置可以是显示屏上覆盖的触摸层,也可 以是音频设备外壳上设置的按键、轨迹球或触控板,还可以是外接的键盘、触控板或鼠标等。In one embodiment, an audio device is provided, and the audio device may be an earphone, and the internal structure diagram thereof may be as shown in FIG. 5 . The audio device includes a processor, a memory, a communication interface, a display screen, and an input device connected through a system bus. Among them, the processor of the audio device is used to provide computing and control capabilities. The memory of the audio device includes a non-volatile storage medium and an internal memory. The nonvolatile storage medium stores an operating system and a computer program. The internal memory provides an environment for the execution of the operating system and computer programs in the non-volatile storage medium. The communication interface of the audio device is used for wired or wireless communication with an external terminal, and the wireless communication can be realized by WIFI, operator network, near field communication (NFC), Bluetooth or other technologies. The computer program when executed by a processor implements a noise reduction method. The display screen of the audio equipment may be a liquid crystal display screen or an electronic ink display screen, and the input device of the audio equipment may be a touch layer covered on the display screen, or a button, a trackball or a touchpad set on the casing of the audio equipment , or an external keyboard, trackpad, or mouse.
本领域技术人员可以理解,图5中示出的结构,仅仅是与本公开方案相关的部分结构的框图,并不构成对本公开方案所应用于其上的音频设备的限定,具体的音频设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 5 is only a block diagram of a partial structure related to the solution of the present disclosure, and does not constitute a limitation on the audio device to which the solution of the present disclosure is applied. The specific audio device may be Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
在一个实施例中,本公开提供的降噪装置可以实现为一种计算机程序的形式,计算机程序可在如图5所示的音频设备上运行。音频设备的存储器中可存储组成该降噪装置的各个程序模块,比如,图4所示的第一控制模块110、获取模块120和第二控制模块130。各个程序模块构成的计算机程序使得处理器执行本说明书中描述的本公开各个实施例的降噪方法中的步骤。In one embodiment, the noise reduction apparatus provided by the present disclosure may be implemented in the form of a computer program, and the computer program may run on the audio device as shown in FIG. 5 . The memory of the audio device may store various program modules that constitute the noise reduction apparatus, for example, the first control module 110, the acquisition module 120, and the second control module 130 shown in FIG. 4 . The computer program constituted by each program module causes the processor to execute the steps in the noise reduction method of each embodiment of the present disclosure described in this specification.
例如,图5所示的音频设备可以通过如图4所示的降噪装置中的第一控制模块110执行前述步骤S1。音频设备可通过获取模块120执行前述步骤S2。音频设备可通过第二控制模块130执行前述步骤S3。For example, the audio device shown in FIG. 5 may execute the aforementioned step S1 through the first control module 110 in the noise reduction apparatus shown in FIG. 4 . The audio device may perform the aforementioned step S2 through the obtaining module 120 . The audio device may perform the aforementioned step S3 through the second control module 130 .
在一个实施例中,提供了一种音频设备,包括存储器和处理器,该存储器存储有计算机程序,该处理器执行计算机程序时实现以下步骤:In one embodiment, an audio device is provided, comprising a memory and a processor, the memory stores a computer program, and the processor implements the following steps when executing the computer program:
步骤S1:耳机处于主动降噪模式时,若检测到用户正在通话,则控制耳机退出主动降噪模式。Step S1: When the headset is in the active noise reduction mode, if it is detected that the user is talking, the headset is controlled to exit the active noise reduction mode.
步骤S2:获取通话过程中用户的声音能量和环境噪音能量。Step S2: Acquire the sound energy and ambient noise energy of the user during the call.
步骤S3:若根据声音能量和环境噪音能量确定满足启动环境降噪条件,则控制耳机启动环境降噪模式。Step S3: If it is determined according to the sound energy and the environmental noise energy that the conditions for enabling environmental noise reduction are met, control the earphone to start the environmental noise reduction mode.
在一个实施例中,处理器执行计算机程序时还实现以下步骤:In one embodiment, the processor further implements the following steps when executing the computer program:
步骤S11:确定用户为目标用户。Step S11: Determine the user as the target user.
步骤S12:目标用户在通话过程中的音量高于预设的音量阈值,则判定检测到用户正在通话。Step S12: If the volume of the target user during the call is higher than the preset volume threshold, it is determined that the user is on a call.
在一个实施例中,处理器执行计算机程序时还实现以下步骤:In one embodiment, the processor further implements the following steps when executing the computer program:
步骤S111:识别用户在通话过程中的声纹特征。Step S111: Identify the voiceprint feature of the user during the call.
步骤S112:将声纹特征与预设的对应于目标用户的基准声纹进行 匹配。Step S112: Match the voiceprint feature with a preset reference voiceprint corresponding to the target user.
步骤S113:若匹配度高于预设的匹配度阈值,则确定用户为目标用户。Step S113: If the matching degree is higher than the preset matching degree threshold, determine the user as the target user.
在一个实施例中,处理器执行计算机程序时还实现以下步骤:若用户的声音能量与环境噪声能量的比值大于预设的信噪比阈值,则确定满足启动环境降噪条件。In one embodiment, when the processor executes the computer program, the processor further implements the following steps: if the ratio of the user's sound energy to the ambient noise energy is greater than a preset signal-to-noise ratio threshold, it is determined that a condition for enabling ambient noise reduction is satisfied.
本公开实施例的音频设备,检测到用户正在使用耳机通话时,控制耳机退出主动降噪模式,即在通话时自动关闭主动降噪功能,从而为用户提供背景底噪作为参考,利于用户更好的控制音量大小,避免由于声音过大而引起周围人或通话对方的不适感。另外,根据获取的通话过程中用户的声音能量和环境噪音能量确定满足启动环境降噪条件时,控制耳机启动环境降噪功能,即实现自适应环境降噪,从而可避免环境噪声对通话声音的干扰,利于向通话对方提高优质的语音信号,从而提高了通话质量。The audio device of the embodiment of the present disclosure controls the headset to exit the active noise reduction mode when it is detected that the user is using the headset to talk, that is, the active noise reduction function is automatically turned off during the call, so as to provide the user with background noise as a reference, which is beneficial to the user. It can control the volume level to avoid the discomfort caused by the surrounding people or the other party due to the loud sound. In addition, according to the acquired sound energy and ambient noise energy of the user during the call, when it is determined that the conditions for enabling ambient noise reduction are met, the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
在一个实施例中,提供了一种计算机可读存储介质,其上存储有计算机程序,计算机程序被处理器执行时实现以下步骤:In one embodiment, a computer-readable storage medium is provided on which a computer program is stored, and when the computer program is executed by a processor, the following steps are implemented:
步骤S1:耳机处于主动降噪模式时,若检测到用户正在通话,则控制耳机退出主动降噪模式。Step S1: When the headset is in the active noise reduction mode, if it is detected that the user is talking, the headset is controlled to exit the active noise reduction mode.
步骤S2:获取通话过程中用户的声音能量和环境噪音能量。Step S2: Acquire the sound energy and ambient noise energy of the user during the call.
步骤S3:若根据声音能量和环境噪音能量确定满足启动环境降噪条件,则控制耳机启动环境降噪模式。Step S3: If it is determined according to the sound energy and the environmental noise energy that the conditions for enabling environmental noise reduction are met, control the earphone to start the environmental noise reduction mode.
在一个实施例中,计算机程序被处理器执行时还实现以下步骤:In one embodiment, the computer program further implements the following steps when executed by the processor:
步骤S11:确定用户为目标用户。Step S11: Determine the user as the target user.
步骤S12:目标用户在通话过程中的音量高于预设的音量阈值,则判定检测到用户正在通话。Step S12: If the volume of the target user during the call is higher than the preset volume threshold, it is determined that the user is on a call.
在一个实施例中,计算机程序被处理器执行时还实现以下步骤:In one embodiment, the computer program further implements the following steps when executed by the processor:
步骤S111:识别用户在通话过程中的声纹特征。Step S111: Identify the voiceprint feature of the user during the call.
步骤S112:将声纹特征与预设的对应于目标用户的基准声纹进行匹配。Step S112: Match the voiceprint feature with a preset reference voiceprint corresponding to the target user.
步骤S113:若匹配度高于预设的匹配度阈值,则确定用户为目标 用户。Step S113: If the matching degree is higher than the preset matching degree threshold, determine the user as the target user.
在一个实施例中,计算机程序被处理器执行时还实现以下步骤:若用户的声音能量与环境噪声能量的比值大于预设的信噪比阈值,则确定满足启动环境降噪条件。In one embodiment, when the computer program is executed by the processor, the following steps are further implemented: if the ratio of the user's sound energy to the ambient noise energy is greater than a preset signal-to-noise ratio threshold, it is determined that a condition for enabling ambient noise reduction is satisfied.
本公开实施例的计算机可读存储介质,检测到用户正在使用耳机通话时,控制耳机退出主动降噪模式,即在通话时自动关闭主动降噪功能,从而为用户提供背景底噪作为参考,利于用户更好的控制音量大小,避免由于声音过大而引起周围人或通话对方的不适感。另外,根据获取的通话过程中用户的声音能量和环境噪音能量确定满足启动环境降噪条件时,控制耳机启动环境降噪功能,即实现自适应环境降噪,从而可避免环境噪声对通话声音的干扰,利于向通话对方提高优质的语音信号,从而提高了通话质量。The computer-readable storage medium of the embodiment of the present disclosure controls the headset to exit the active noise reduction mode when it is detected that the user is using the headset to talk, that is, the active noise reduction function is automatically turned off during the call, so as to provide the user with background noise as a reference, which is beneficial to The user can better control the volume and avoid the discomfort caused by the loudness of the surrounding people or the other party in the call. In addition, according to the acquired sound energy and ambient noise energy of the user during the call, when it is determined that the conditions for enabling ambient noise reduction are met, the headset is controlled to enable the ambient noise reduction function, that is, adaptive ambient noise reduction is realized, thereby avoiding the impact of ambient noise on the call sound. Interference, which is beneficial to improve the high-quality voice signal to the other party of the call, thereby improving the quality of the call.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一非易失性计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本公开所提供的各实施例中所使用的对存储器、数据库或其它介质的任何引用,均可包括非易失性和易失性存储器中的至少一种。非易失性存储器可包括只读存储器(Read-Only Memory,ROM)、磁带、软盘、闪存或光存储器等。易失性存储器可包括随机存取存储器(Random Access Memory,RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,比如静态随机存取存储器(Static Random Access Memory,SRAM)和动态随机存取存储器(Dynamic Random Access Memory,DRAM)等。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by instructing relevant hardware through a computer program, and the computer program can be stored in a non-volatile computer-readable storage In the medium, when the computer program is executed, it may include the processes of the above-mentioned method embodiments. Wherein, any reference to memory, database or other media used in the various embodiments provided by the present disclosure may include at least one of non-volatile and volatile memory. Non-volatile memory may include read-only memory (Read-Only Memory, ROM), magnetic tape, floppy disk, flash memory, or optical memory, and the like. Volatile memory may include random access memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in various forms, such as Static Random Access Memory (SRAM) and Dynamic Random Access Memory (DRAM).
以上实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above embodiments can be combined arbitrarily. For the sake of brevity, all possible combinations of the technical features in the above embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, all It is considered to be the range described in this specification.
以上所述实施例仅表达了本公开的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出 的是,对于本领域的普通技术人员来说,在不脱离本公开构思的前提下,还可以做出若干变形和改进,这些都属于本公开的保护范围。因此,本公开专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only represent several embodiments of the present disclosure, and the descriptions thereof are specific and detailed, but should not be construed as limiting the scope of the invention patent. It should be pointed out that for those of ordinary skill in the art, without departing from the concept of the present disclosure, several modifications and improvements can also be made, which all belong to the protection scope of the present disclosure. Accordingly, the scope of protection of the present disclosure should be determined by the appended claims.
本公开提供的降噪方法,使得在用户使用耳机进行通话时,通过控制耳机退出主动降噪模式,启动环境降噪功能来避免环境噪声对通话声音的干扰,有利于向通话对方提供优质的语音信号,提高了通话质量,具有很强的工业实用性。The noise reduction method provided by the present disclosure enables the user to control the headset to exit the active noise reduction mode and activate the environmental noise reduction function to avoid the interference of the environmental noise on the call sound when the user uses the headset to make a call, which is beneficial to provide high-quality voice to the other party of the call. signal, improve the call quality, and has strong industrial practicability.
Claims (16)
- 一种降噪方法,应用于耳机,其特征在于,所述方法包括:A noise reduction method, applied to earphones, characterized in that the method comprises:所述耳机处于主动降噪模式时,若检测到用户正在通话,则控制所述耳机退出所述主动降噪模式;When the headset is in the active noise reduction mode, if it is detected that the user is talking, controlling the headset to exit the active noise reduction mode;获取通话过程中所述用户的声音能量和环境噪音能量;Obtain the voice energy and ambient noise energy of the user during the call;若根据所述声音能量和环境噪音能量确定满足启动环境降噪条件,则控制所述耳机启动环境降噪模式。If it is determined according to the sound energy and the ambient noise energy that the conditions for enabling ambient noise reduction are met, the earphone is controlled to enable the ambient noise reduction mode.
- 根据权利要求1所述的方法,其特征在于,所述检测到用户正在通话的过程,包括:The method according to claim 1, wherein the process of detecting that the user is talking includes:确定所述用户为目标用户;determining that the user is a target user;所述目标用户在通话过程中的音量高于预设的音量阈值,则判定检测到所述用户正在通话。If the volume of the target user during the call is higher than a preset volume threshold, it is determined that the user is on a call.
- 根据权利要求2所述的方法,其特征在于,所述确定所述用户为目标用户的过程,包括:The method according to claim 2, wherein the process of determining the user as a target user comprises:识别所述用户在通话过程中的声纹特征;Identify the voiceprint feature of the user during the call;将所述声纹特征与预设的对应于所述目标用户的基准声纹进行匹配;matching the voiceprint feature with a preset reference voiceprint corresponding to the target user;若匹配度高于预设的匹配度阈值,则确定所述用户为所述目标用户。If the matching degree is higher than a preset matching degree threshold, it is determined that the user is the target user.
- 根据权利要求2所述的方法,其特征在于,所述方法还包括:The method according to claim 2, wherein the method further comprises:检测所述目标用户在通话过程中的音量;detecting the volume of the target user during the call;若所述目标用户在通话过程中的音量小于或等于预设的音量阈值,则判定所述目标用户未进行通话。If the volume of the target user during the call is less than or equal to a preset volume threshold, it is determined that the target user is not talking.
- 根据权利要求1所述的方法,其特征在于,所述控制所述耳机启动环境降噪模式,包括:The method according to claim 1, wherein the controlling the headset to activate an ambient noise reduction mode comprises:通过双麦克风阵列计算所述用户说话的方位;Calculate the orientation of the user's speech through a dual-microphone array;基于所述用户说话的方位确定目标语音和环境噪音;determining target speech and ambient noise based on the location where the user speaks;对所述环境噪音进行去除。The ambient noise is removed.
- 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, wherein the method further comprises:确定所述声音能量和环境噪音能量;determining the sound energy and ambient noise energy;若所述声音能量和环境噪音能量不满足所述启动环境降噪条件,则不启动环境降噪模式。If the sound energy and ambient noise energy do not meet the activation environment noise reduction condition, the ambient noise reduction mode is not activated.
- 根据权利要求1-6任一项所述的方法,其特征在于,所述根据所述声音能量和环境噪音能量确定满足启动环境降噪条件的过程,包括:The method according to any one of claims 1-6, characterized in that, the process of determining, according to the sound energy and ambient noise energy, that a condition for enabling ambient noise reduction is met, comprises:若所述用户的声音能量与所述环境噪声能量的比值大于预设的信噪比阈值,则确定满足所述启动环境降噪条件。If the ratio of the user's sound energy to the environmental noise energy is greater than a preset signal-to-noise ratio threshold, it is determined that the activation environment noise reduction condition is satisfied.
- 一种降噪装置,应用于耳机,其特征在于,所述装置包括:A noise reduction device, applied to earphones, characterized in that the device comprises:第一控制模块,用于在所述耳机处于主动降噪模式下,当检测到用户正在通话时,控制所述耳机退出所述主动降噪模式;a first control module, configured to control the headset to exit the active noise reduction mode when it is detected that the user is talking when the headset is in the active noise reduction mode;获取模块,用于获取通话过程中所述用户的声音能量和环境噪音能量;an acquisition module for acquiring the user's voice energy and ambient noise energy during the call;第二控制模块,用于根据所述声音能量和环境噪音能量确定满足启动环境降噪条件时,控制所述耳机启动环境降噪模式。The second control module is configured to control the earphone to activate the ambient noise reduction mode when it is determined according to the sound energy and the ambient noise energy that the conditions for enabling ambient noise reduction are satisfied.
- 根据权利要求8所述的装置,其特征在于,所述第一控制模块,具体用于:确定所述用户为目标用户,且所述目标用户在通话过程中的音量高于预设的音量阈值时,判定检测到所述用户正在通话。The device according to claim 8, wherein the first control module is specifically configured to: determine that the user is a target user, and the volume of the target user during the call is higher than a preset volume threshold When , it is determined that the user is on a call.
- 根据权利要求9所述的装置,其特征在于,所述第一控制模块,具体用于:The device according to claim 9, wherein the first control module is specifically configured to:识别所述用户在通话过程中的声纹特征;Identify the voiceprint feature of the user during the call;将所述声纹特征与预设的对应于所述目标用户的基准声纹进行匹配;matching the voiceprint feature with a preset reference voiceprint corresponding to the target user;当匹配度高于预设的匹配度阈值时,确定所述用户为所述目标用户。When the matching degree is higher than a preset matching degree threshold, it is determined that the user is the target user.
- 根据权利要求9所述的装置,其特征在于,所述第二控制模块,还用于:The device according to claim 9, wherein the second control module is further configured to:检测所述目标用户在通话过程中的音量;detecting the volume of the target user during the call;若所述目标用户在通话过程中的音量小于或等于预设的音量阈值,则判定所述目标用户未进行通话。If the volume of the target user during the call is less than or equal to a preset volume threshold, it is determined that the target user is not talking.
- 根据权利要求8所述的装置,其特征在于,所述第二控制模块,具体用于:The device according to claim 8, wherein the second control module is specifically used for:通过双麦克风阵列计算所述用户说话的方位;Calculate the orientation of the user's speech through a dual-microphone array;基于所述用户说话的方位确定目标语音和环境噪音;determining target speech and ambient noise based on the location where the user speaks;对所述环境噪音进行去除。The ambient noise is removed.
- 根据权利要求8所述的装置,其特征在于,所述第二控制模块,还用于:The device according to claim 8, wherein the second control module is further configured to:确定所述声音能量和环境噪音能量;determining the sound energy and ambient noise energy;若所述声音能量和环境噪音能量不满足所述启动环境降噪条件,则不启动环境降噪模式。If the sound energy and ambient noise energy do not satisfy the activation environment noise reduction condition, the ambient noise reduction mode is not activated.
- 根据权利要求8-13任一项所述的装置,其特征在于,所述第二控制模块,具体用于:当所述用户的声音能量与所述环境噪声能量的比值大于预设的信噪比阈值时,确定满足所述启动环境降噪条件。The device according to any one of claims 8-13, wherein the second control module is specifically configured to: when the ratio of the user's sound energy to the ambient noise energy is greater than a preset signal-to-noise ratio When the ratio is greater than the threshold, it is determined that the startup environment noise reduction condition is satisfied.
- 一种音频设备,包括存储器和处理器,所述存储器存储有计算机程序,其特征在于,所述处理器执行所述计算机程序时实现权利要求1至7中任一项所述降噪方法的步骤。An audio device, comprising a memory and a processor, wherein the memory stores a computer program, wherein the processor implements the steps of the noise reduction method according to any one of claims 1 to 7 when the processor executes the computer program .
- 一种计算机可读存储介质,其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现权利要求1至7中任一项所述的降噪方法的步骤。A computer-readable storage medium on which a computer program is stored, characterized in that, when the computer program is executed by a processor, the steps of the noise reduction method according to any one of claims 1 to 7 are implemented.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110065090.2 | 2021-01-18 | ||
CN202110065090.2A CN112911441A (en) | 2021-01-18 | 2021-01-18 | Noise reduction method, apparatus, audio device, and computer-readable storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022151657A1 true WO2022151657A1 (en) | 2022-07-21 |
Family
ID=76115930
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/101463 WO2022151657A1 (en) | 2021-01-18 | 2021-06-22 | Noise cancellation method and apparatus, and audio device and computer-readable storage medium |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112911441A (en) |
WO (1) | WO2022151657A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117641178A (en) * | 2023-11-17 | 2024-03-01 | 广州伟仕达电子科技有限公司 | Active noise reduction Bluetooth headset and assembly thereof |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112911441A (en) * | 2021-01-18 | 2021-06-04 | 上海闻泰信息技术有限公司 | Noise reduction method, apparatus, audio device, and computer-readable storage medium |
CN113470661A (en) * | 2021-06-17 | 2021-10-01 | 深圳市视晶无线技术有限公司 | Audio talkback starting request method for realizing automatic PTT, audio talkback method and storage medium |
CN113596670B (en) * | 2021-08-30 | 2022-10-14 | 歌尔科技有限公司 | Earphone, earphone noise reduction mode switching method and device, and storage medium |
CN113891207A (en) * | 2021-09-24 | 2022-01-04 | 深圳艾创力科技有限公司 | ANC noise reduction method and system for neck-wearing voice interaction earphone |
CN113891109B (en) * | 2021-12-08 | 2022-03-15 | 深圳市北科瑞声科技股份有限公司 | Adaptive noise reduction method, device, equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107910013A (en) * | 2017-11-10 | 2018-04-13 | 广东欧珀移动通信有限公司 | The output processing method and device of a kind of voice signal |
US10129633B1 (en) * | 2017-10-13 | 2018-11-13 | Bose Corporation | Automated awareness for ANR systems |
CN109195044A (en) * | 2018-08-08 | 2019-01-11 | 歌尔股份有限公司 | Noise cancelling headphone, call terminal and method for noise reduction control and the way of recording |
CN112911441A (en) * | 2021-01-18 | 2021-06-04 | 上海闻泰信息技术有限公司 | Noise reduction method, apparatus, audio device, and computer-readable storage medium |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105959842A (en) * | 2016-04-29 | 2016-09-21 | 歌尔股份有限公司 | Earphone noise reduction processing method and device, and earphone |
CN108551604B (en) * | 2018-04-26 | 2021-05-07 | 海尔优家智能科技(北京)有限公司 | Noise reduction method, noise reduction device and noise reduction earphone |
CN109688513A (en) * | 2018-11-19 | 2019-04-26 | 恒玄科技(上海)有限公司 | Wireless active noise reduction earphone and double active noise reduction earphone communicating data processing methods |
CN111836147B (en) * | 2019-04-16 | 2022-04-12 | 华为技术有限公司 | Noise reduction device and method |
CN111464905A (en) * | 2020-04-09 | 2020-07-28 | 电子科技大学 | Hearing enhancement method and system based on intelligent wearable device and wearable device |
-
2021
- 2021-01-18 CN CN202110065090.2A patent/CN112911441A/en active Pending
- 2021-06-22 WO PCT/CN2021/101463 patent/WO2022151657A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US10129633B1 (en) * | 2017-10-13 | 2018-11-13 | Bose Corporation | Automated awareness for ANR systems |
CN107910013A (en) * | 2017-11-10 | 2018-04-13 | 广东欧珀移动通信有限公司 | The output processing method and device of a kind of voice signal |
CN109195044A (en) * | 2018-08-08 | 2019-01-11 | 歌尔股份有限公司 | Noise cancelling headphone, call terminal and method for noise reduction control and the way of recording |
CN112911441A (en) * | 2021-01-18 | 2021-06-04 | 上海闻泰信息技术有限公司 | Noise reduction method, apparatus, audio device, and computer-readable storage medium |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117641178A (en) * | 2023-11-17 | 2024-03-01 | 广州伟仕达电子科技有限公司 | Active noise reduction Bluetooth headset and assembly thereof |
Also Published As
Publication number | Publication date |
---|---|
CN112911441A (en) | 2021-06-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022151657A1 (en) | Noise cancellation method and apparatus, and audio device and computer-readable storage medium | |
US11483434B2 (en) | Method and apparatus for adjusting volume of user terminal, and terminal | |
US11748462B2 (en) | Biometric authentication | |
US11270696B2 (en) | Audio device with wakeup word detection | |
CN110268470B (en) | Audio device filter modification | |
US11257512B2 (en) | Adaptive spatial VAD and time-frequency mask estimation for highly non-stationary noise sources | |
US11437021B2 (en) | Processing audio signals | |
US20140081644A1 (en) | Method and Device for Voice Operated Control | |
JP6959917B2 (en) | Event detection for playback management in audio equipment | |
JP2018517167A (en) | Method and apparatus for processing audio signals | |
CN107564523B (en) | Earphone answering method and device and earphone | |
EP4394761A1 (en) | Audio signal processing method and apparatus, electronic device, and storage medium | |
US11290802B1 (en) | Voice detection using hearable devices | |
WO2008011319A2 (en) | Method and system for near-end detection | |
WO2018118744A1 (en) | Methods and systems for reducing false alarms in keyword detection | |
US9373338B1 (en) | Acoustic echo cancellation processing based on feedback from speech recognizer | |
CN108810778B (en) | Method for operating a hearing device and hearing device | |
US11455980B2 (en) | Vehicle and controlling method of vehicle | |
CN115623121A (en) | Call method and electronic equipment | |
KR20200010149A (en) | Apparatus for recognizing call sign and method for the same | |
CN116320872A (en) | Earphone mode switching method and device, electronic equipment and storage medium | |
CN116320867A (en) | Wind noise detection method and device and earphone | |
CN116887126A (en) | Noise cancellation method, device, equipment and computer readable storage medium | |
KR20140006195A (en) | Echo canceler device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21918853 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21918853 Country of ref document: EP Kind code of ref document: A1 |