[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN115148203B - Pickup calibration method and device, nonvolatile storage medium and electronic equipment - Google Patents

Pickup calibration method and device, nonvolatile storage medium and electronic equipment Download PDF

Info

Publication number
CN115148203B
CN115148203B CN202210631086.2A CN202210631086A CN115148203B CN 115148203 B CN115148203 B CN 115148203B CN 202210631086 A CN202210631086 A CN 202210631086A CN 115148203 B CN115148203 B CN 115148203B
Authority
CN
China
Prior art keywords
energy value
audio
microphone
calibration coefficient
calibration
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210631086.2A
Other languages
Chinese (zh)
Other versions
CN115148203A (en
Inventor
王祖悦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qingdao Haier Technology Co Ltd
Haier Smart Home Co Ltd
Original Assignee
Qingdao Haier Technology Co Ltd
Haier Smart Home Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qingdao Haier Technology Co Ltd, Haier Smart Home Co Ltd filed Critical Qingdao Haier Technology Co Ltd
Priority to CN202210631086.2A priority Critical patent/CN115148203B/en
Publication of CN115148203A publication Critical patent/CN115148203A/en
Application granted granted Critical
Publication of CN115148203B publication Critical patent/CN115148203B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/60Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for measuring the quality of voice signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/2803Home automation networks
    • H04L12/2816Controlling appliance services of a home automation network by calling their functionalities
    • H04L12/282Controlling appliance services of a home automation network by calling their functionalities based on user interaction within the home
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Automation & Control Theory (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The application discloses a pickup calibration method, a device, a nonvolatile storage medium and electronic equipment, and relates to the field of intelligent home, wherein the pickup calibration method comprises the following steps: acquiring first audio and second audio, wherein the first audio is obtained by a first microphone of a first device collecting wake-up voice sent by a standard device, and the second audio is obtained by a second microphone of a second device collecting wake-up voice sent by the standard device; determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value; and respectively carrying out pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.

Description

Pickup calibration method and device, nonvolatile storage medium and electronic equipment
Technical Field
The application relates to the field of intelligent home, in particular to a pickup calibration method and device, a nonvolatile storage medium and electronic equipment.
Background
In the intelligent home scene, a plurality of intelligent devices exist in the home of the user, and most of the intelligent devices support voice interaction with the user nowadays so as to realize intelligent control of the intelligent devices by the user and reduce the workload of the user. However, when the user sends out a voice command to the intelligent device that the user wants to control, other intelligent devices also receive the sent voice command of the user, so that the plurality of intelligent devices need to share information to judge which intelligent device the voice command currently sent by the user is matched with, and inconvenience is brought to the life of the user due to the fact that the wrong device responds to the user command is avoided. However, the related art has the following problems: by adopting a mode of calling the device name and adding the user instruction, the user needs to memorize the accurate names of various devices, so that the voice control behavior of the user is very complicated, and the user experience is poor; or the energy of the voice command received by different intelligent devices is used as the judgment quantity, but the method is limited by different pickup capacities of the different intelligent devices, and accurate judgment is difficult to carry out.
In view of the above problems, no effective solution has been proposed at present.
Disclosure of Invention
The embodiment of the invention provides a pickup calibration method, a pickup calibration device, a computer-readable storage medium and electronic equipment, which are used for at least solving the technical problem that a convenient equipment pickup calibration method is lacked in an intelligent home scene.
According to an aspect of an embodiment of the present invention, there is provided a pickup calibration method including: acquiring first audio and second audio, wherein the first audio is obtained by collecting wake-up voice sent by standard equipment by a first microphone of first equipment, and the second audio is obtained by collecting the wake-up voice sent by the standard equipment by a second microphone of second equipment; determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value; and respectively carrying out pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.
Optionally, the determining, based on the first energy value and the second energy value, a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device includes: acquiring third audio and fourth audio, wherein the third audio is audio obtained by collecting the wake-up voice sent by the first device through a standard microphone of the standard device, and the fourth audio is audio obtained by collecting the wake-up voice sent by the second device through the standard microphone of the standard device; determining a third energy value corresponding to the third audio and a fourth energy value corresponding to the fourth audio; the first calibration coefficient is determined based on the first energy value and the third energy value, and the second calibration coefficient is determined based on the second energy value and the fourth energy value.
Optionally, the determining the first calibration coefficient based on the first energy value and the third energy value, and the determining the second calibration coefficient based on the second energy value and the fourth energy value, includes: determining a ratio of the third energy value and the first energy value as the first calibration coefficient; and determining a ratio of the fourth energy value and the second energy value as the second calibration coefficient.
Optionally, the method further comprises: acquiring an online equipment list, wherein the online equipment list comprises a plurality of equipment; selecting one calibrated device in the online device list as the standard device, and selecting the first device and the second device in the online device list as devices to be calibrated; and sending wake-up word data to the standard equipment, wherein the wake-up word data is used for generating the wake-up voice by the standard equipment.
Optionally, the method further comprises: and sending the wake-up word data to the first device and the second device, wherein the first device and the second device respond to wake-up voice sent by the standard device based on the wake-up word data, and call the first microphone and the second microphone to collect the wake-up voice sent by the standard device to obtain the first audio and the second audio respectively.
Optionally, the performing pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient respectively includes: transmitting the first calibration coefficient to the first device, wherein the first device calibrates the energy value of the received audio according to the first calibration coefficient; and transmitting the second calibration coefficient to the second device, wherein the second device calibrates the energy value of the received audio according to the second calibration coefficient.
Optionally, the method further comprises: acquiring fifth audio, wherein the fifth audio is audio obtained by acquiring wake-up voice sent by the standard equipment by a third microphone of the third equipment; determining a fifth energy value corresponding to the fifth audio; determining a third correction coefficient corresponding to the third device based on the fifth energy value and any one of the first energy value and the second energy value; and carrying out pickup calibration on the third microphone according to the third correction coefficient.
According to another aspect of the embodiment of the present invention, there is also provided a pickup calibration apparatus including: the system comprises an acquisition module, a first audio acquisition module and a second audio acquisition module, wherein the first audio is obtained by a first microphone of a first device acquiring wake-up voice sent by a standard device, and the second audio is obtained by a second microphone of a second device acquiring the wake-up voice sent by the standard device; the first determining module is used for determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; a second determining module, configured to determine a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value; and the calibration module is used for respectively carrying out pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.
According to still another aspect of the embodiments of the present invention, there is also provided a computer-readable storage medium including a stored program, wherein the program, when executed, controls a device in which the computer-readable storage medium is located to execute any one of the pickup calibration methods described above.
According to still another aspect of the embodiment of the present invention, there is also provided an electronic apparatus including a processor for executing a program, wherein the program executes any one of the pickup calibration methods described above.
In the embodiment of the invention, the first audio and the second audio acquired by the microphones of the first equipment and the second equipment are acquired, the first energy value and the second energy value are determined based on the first audio and the second audio, and the first calibration coefficient and the second calibration coefficient for carrying out pickup calibration on the first microphone of the first equipment and the second microphone of the second equipment are determined based on the first energy value and the second energy value, so that the purpose of calibrating the pickup capacities of the first equipment and the second equipment in the smart home scene is achieved, the technical effect of providing a convenient device pickup calibration method in the smart home scene is achieved, and the technical problem that a convenient device pickup calibration method is lacked in the smart home scene is solved.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and together with the description, serve to explain the principles of the application.
In order to more clearly illustrate the embodiments of the application or the technical solutions of the prior art, the drawings which are used in the description of the embodiments or the prior art will be briefly described, and it will be obvious to a person skilled in the art that other drawings can be obtained from these drawings without inventive effort.
FIG. 1 is a schematic diagram of a hardware environment of an interaction method of a smart device according to an embodiment of the present application;
fig. 2 is a flowchart of a pickup calibration method according to an embodiment of the present invention;
Fig. 3 is a block diagram of a sound pickup calibration apparatus according to an embodiment of the present invention.
Detailed Description
In order that those skilled in the art will better understand the present application, a technical solution in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in which it is apparent that the described embodiments are only some embodiments of the present application, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present application without making any inventive effort, shall fall within the scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and the claims of the present application and the above figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the application described herein may be implemented in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
According to one aspect of the embodiment of the application, an interaction method of intelligent household equipment is provided. The interaction method of the intelligent household equipment is widely applied to full-house intelligent digital control application scenes such as intelligent Home (Smart Home), intelligent Home, intelligent household equipment ecology, intelligent Home (INTELLIGENCE HOUSE) ecology and the like. Alternatively, in this embodiment, the above-mentioned interaction method of the smart home device may be applied to a hardware environment formed by the terminal device 102 and the server 104 as shown in fig. 1. As shown in fig. 1, the server 104 is connected to the terminal device 102 through a network, and may be used to provide services (such as application services and the like) for a terminal or a client installed on the terminal, a database may be set on the server or independent of the server, for providing data storage services for the server 104, and cloud computing and/or edge computing services may be configured on the server or independent of the server, for providing data computing services for the server 104.
The network may include, but is not limited to, at least one of: wired network, wireless network. The wired network may include, but is not limited to, at least one of: a wide area network, a metropolitan area network, a local area network, and the wireless network may include, but is not limited to, at least one of: WIFI (WIRELESS FIDELITY ), bluetooth. The terminal device 102 may not be limited to a PC, a mobile phone, a tablet computer, an intelligent air conditioner, an intelligent smoke machine, an intelligent refrigerator, an intelligent oven, an intelligent cooking range, an intelligent washing machine, an intelligent water heater, an intelligent washing device, an intelligent dish washer, an intelligent projection device, an intelligent television, an intelligent clothes hanger, an intelligent curtain, an intelligent video, an intelligent socket, an intelligent sound box, an intelligent fresh air device, an intelligent kitchen and toilet device, an intelligent bathroom device, an intelligent sweeping robot, an intelligent window cleaning robot, an intelligent mopping robot, an intelligent air purifying device, an intelligent steam box, an intelligent microwave oven, an intelligent kitchen appliance, an intelligent purifier, an intelligent water dispenser, an intelligent door lock, and the like.
The application provides an embodiment of a method for accurately judging target equipment which is expected to be controlled by a user sending a voice instruction by carrying out pickup calibration on the equipment. In the related art, when performing pickup calibration on different devices, calibration of pickup microphones of the respective devices is required, that is: all microphones were calibrated to the same standard microphone. This is also the current mainstream calibration scheme. For example, an offline real-time manual calibration is employed, namely: the sound producing position and the sound receiving position are fixed, a standard loudspeaker is adopted to play audio, equipment to be calibrated and a standard microphone are placed at the sound receiving position to record audio, and then the audio recorded by the equipment to be calibrated is compared with the audio recorded by the standard microphone to realize calibration; generally, the related technology can perform audio comparison in the frequency domain, calibrate the amplitude of the key frequency point to the amplitude of the corresponding frequency point of the standard equipment, and can also use simple time domain energy comparison. However, in practical applications, with the increase of intelligent devices, especially the increase of low-cost devices, however, due to the fact that the hardware difference of the home appliances is too large, the consistency of microphones is too poor, and with the increase of the number of products in the smart home environment, users cannot manually debug and calibrate each intelligent device in an off-line manner. Therefore, the following pickup calibration method is proposed in this embodiment to solve the technical problem that it is not possible to accurately determine which device of a plurality of devices is to be controlled by a voice command issued by a user.
Fig. 2 is a flow chart of a pickup calibration method according to an embodiment of the present invention, as shown in fig. 2, the method includes the following steps:
Step S202, acquiring a first audio and a second audio, wherein the first audio is obtained by a first microphone of a first device collecting wake-up voice sent by a standard device, and the second audio is obtained by a second microphone of a second device collecting wake-up voice sent by the standard device.
In this step, the first device, the second device, and the standard device may be smart devices in a smart home, where each of the first device and the second device has a function of responding to a voice of a user to perform an action. Alternatively, the wake-up speech may be emitted by a speaker of the standard device and captured by a microphone of each of the first device and the second device.
Step S204, determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio.
Alternatively, the first energy value and the second energy value in this step may be the total energy of the first audio and the second audio, respectively, or may be the time domain energy in the corresponding time domains in the first audio and the second audio. Since the first audio and the second audio are obtained by the first microphone and the second microphone respectively picking up wake-up speech of the standard device, the first audio and the second audio should correspond in the time domain. Accordingly, a time domain range may be determined, and further, a first energy value and a second energy value of the first audio and the second audio, respectively, within the time domain range may be determined, respectively.
Step S206, determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value.
In this step, based on the energy value of the wake-up voice sent by the standard device and received by the first device and the second device, the difference in the pickup capacities of the microphones of the first device and the second device can be determined, so that the microphone pickup calibration method can be used for accurately performing pickup calibration on the first microphone and the second microphone.
And step S208, the pickup calibration is carried out on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient respectively.
The difference of pickup capability exists between the first microphone and the second microphone due to the difference of product models and use states, the energy value of the audio collected by the microphones is processed by adopting the first calibration coefficient, and the capability difference between the first microphone and the second microphone can be eliminated, so that the corrected microphones pick up on the same pickup level, for example, wake-up competition is carried out. Optionally, if the user utters a voice for waking up the device, the calibrated first device and second device both receive the voice uttered by the user through the microphone. Because the first microphone and the second microphone are calibrated by the first calibration coefficient and the second calibration coefficient respectively, the relative position difference and sounding direction difference of the user and the first device and the second device can be determined based on the difference of the energy value of the audio acquired by the first microphone and the energy value of the audio acquired by the second microphone, and the device is helped to accurately judge which device of the first device and the second device the target device wants to control.
In the above steps, the first audio and the second audio acquired by the first microphone and the second microphone of the second device are acquired, the first energy value and the second energy value are determined based on the first audio and the second audio, and the first calibration coefficient and the second calibration coefficient for pickup calibration of the first microphone of the first device and the second microphone of the second device are determined based on the first energy value and the second energy value, so that the purpose of calibrating pickup capacities of the first device and the second device in the smart home scene is achieved, the technical effect of providing a convenient device pickup calibration method in the smart home scene is achieved, and the technical problem that a convenient device pickup calibration method is absent in the smart home scene is solved.
As an alternative embodiment, the determining the first calibration coefficient corresponding to the first device and the second calibration coefficient corresponding to the second device based on the first energy value and the second energy value may comprise the following steps: acquiring third audio and fourth audio, wherein the third audio is audio obtained by collecting wake-up voice sent by a first device through a standard microphone of standard equipment, and the fourth audio is audio obtained by collecting wake-up voice sent by a second device through a standard microphone of the standard equipment; determining a third energy value corresponding to the third audio and a fourth energy value corresponding to the fourth audio; the first calibration coefficient is determined based on the first energy value and the third energy value, and the second calibration coefficient is determined based on the second energy value and the fourth energy value.
In this alternative embodiment, the first device and the second device may sequentially emit wake-up voices, receive the wake-up voices from a standard microphone of the standard device, and then determine the respective pickup capacities of the microphones of the first device and the second device based on the audio received by the standard microphone. In the method, the first microphone and the second microphone do not need to be calibrated, the distance from the first device to the standard device is not required to be measured, the distance from the loudspeaker of the standard device to the microphone of the first device is the same as the distance from the loudspeaker of the first device to the microphone of the standard device, so that the interference and attenuation of sound in the way can be considered to be the same in the two processes, and therefore, a first calibration coefficient used for representing the difference of the pickup capacity of the first microphone and the standard microphone and a second calibration coefficient can be determined through the first energy value and the third energy value. After the pickup capability difference of the first microphone and the second microphone relative to the standard microphone is determined, the pickup capability can be calculated instead, and the pickup capability difference between the first microphone and the second microphone is determined.
As an alternative embodiment, the first calibration factor and the second calibration factor may be determined by: determining a ratio of the third energy value to the first energy value as a first calibration coefficient; and determining a ratio of the fourth energy value and the second energy value as a second calibration coefficient.
Alternatively, the first calibration factor may take the form:
wherein, Representing a first calibration factor corresponding to the first device, d i representing the distance between the first device and the standard device (i.e., the distance between the first microphone and the standard microphone),Representing a first energy value determined from first audio acquired by a first microphone of a first device,Representing a third energy value corresponding to a third audio acquired by a standard microphone of the standard device. In case the pick-up capacities of the first microphone and the standard microphone are the same, the first energy value corresponding to the first audio and the third energy value corresponding to the third audio should be comparable, i.eHowever, when the pick-up capability of the first microphone is weaker than that of the standard microphone, there isAt this timeLess than 1. Thus, when the first microphone of the first device receives the user uttered speech, the following calibration may be performed on the speech energy value of the speech received by the first microphone based on the first calibration coefficient: calibrated speech energy value = speech energy value +.Likewise, the pick-up capability of the second microphone of the second device may be calibrated in the manner described above to obtain a second calibration factorBy the optional embodiment, a specific method for calibrating pickup capacities of the first microphone of the first device and the second microphone of the second device is provided, and by adopting the method, pickup capacity difference between the microphones of the first device and the second device can be correctly calibrated, so that the calibration effect is improved.
As an alternative embodiment, the method may further include: acquiring an online equipment list, wherein the online equipment list comprises a plurality of equipment; selecting one calibrated device in the online device list as a standard device, and selecting a first device and a second device in the online device list as devices to be calibrated; and sending wake-up word data to the standard equipment, wherein the wake-up word data is used for generating wake-up voice by the standard equipment.
The above-mentioned alternative embodiment may be applied to an intelligent gateway in an intelligent home scene, where the intelligent gateway obtains an online device list in a home through the internet or a local area network, and obtains calibration information about whether each device in the list has undergone pickup calibration. The intelligent gateway may then determine one calibrated device in the table as a standard device with which to calibrate other devices in the list of online devices that are not pickup calibrated, such as the first device and the second device, to achieve distributed pickup calibration of devices in the home without the aid of a specialized pickup calibration tool.
As an optional embodiment, the intelligent gateway may send wake-up word data to the first device and the second device, where the first device and the second device respond to wake-up voices sent by the standard device based on the wake-up word data, and call the first microphone and the second microphone to collect wake-up voices sent by the standard device to obtain the first audio and the second audio respectively.
As an alternative embodiment, the pickup calibration of the first microphone and the second microphone according to the first calibration factor and the second calibration factor, respectively, may be performed by: transmitting a first calibration coefficient to a first device, wherein the first device calibrates the energy value of the received audio according to the first calibration coefficient; and transmitting a second calibration coefficient to the second device, wherein the second device calibrates the energy value of the received audio according to the second calibration coefficient. After the intelligent gateway calculates the first calibration coefficient and the second calibration coefficient, the first calibration coefficient and the second calibration coefficient are issued to the first device and the second device, and the first microphone and the second microphone are calibrated relative to the standard microphone, which is equivalent to calibrating between the first microphone and the second microphone, so that quick calibration of pickup capacity between devices in a home scene is realized.
As an optional embodiment, obtaining a fifth audio, where the fifth audio is an audio obtained by the third microphone of the third device collecting the wake-up voice sent by the standard device; determining a fifth energy value corresponding to the fifth audio; determining a third correction coefficient corresponding to the third device based on the fifth energy value and any one of the first energy value and the second energy value; and carrying out pickup calibration on the third microphone according to the third correction coefficient. Alternatively, the third device may be another device in the online device list that is not pickup-calibrated, and by this embodiment, calibration of pickup capabilities of a plurality of devices in the home to the same level may be supported. At this time, when the user wakes up the devices in the home by sending out the wake-up word, each device performs wake-up competition based on the microphones with the same pick-up capability, and judges which device the user sends out the wake-up word to, so that the accuracy of the user for waking up the devices is improved, and the target object is prevented from waking up the wrong devices.
When the energy value is used as a judgment condition of the device that the user wants to control, the judgment can be performed according to the energy value received by the device. Specifically, when the target object is located between the first device and the second device, if the target object is closer to the first device, the energy value corresponding to the first audio received by the first microphone of the first device should be greater than the energy value of the second audio received by the second microphone of the second device. However, since the pick-up capability of the first microphone and the pick-up capability of the second microphone may be different, the direct comparison of the energy values of the first audio and the second audio may cause a problem of erroneous judgment. By adopting the optional embodiment, the problems can be solved by calibrating the pickup capacities of the first microphone and the second microphone, and further, the equipment corresponding to the larger energy value is determined to be the equipment to be awakened which is closer to the target object, so that the technical effect of improving the accuracy of judging the equipment to be awakened is achieved.
The steps can be actively performed by a user in a home, one intelligent device is selected as a standard device for pickup calibration in a home environment according to a certain rule, for example, all intelligent devices in the home are traversed, the device with the best pickup effect is used as the standard device, other intelligent devices in the home respectively perform pickup calibration with the standard device, and a calibration flow is executed to obtain a calibration coefficient corresponding to each device. According to the self-calibration method between the household intelligent devices, the technical problem that the target device for judging the voice pointing direction of the target object is inaccurate due to different pickup capacities among different household intelligent devices can be avoided, the calibration process of the pickup capacities of the intelligent devices can be simplified, the method for calibrating the pickup capacities of the intelligent devices in the home is provided for household users, and the efficiency of the pickup capacity calibration process is greatly improved.
According to an embodiment of the present invention, there is also provided a pickup calibration apparatus for implementing the above-described pickup calibration method, fig. 3 is a block diagram of a pickup calibration apparatus provided according to an embodiment of the present invention, and as shown in fig. 3, the pickup calibration apparatus 30 includes: the pickup calibration apparatus 30 is described below as an acquisition module 32, a first determination module 34, a second determination module 36, and a calibration module 38.
An obtaining module 32, configured to obtain a first audio and a second audio, where the first audio is an audio obtained by a first microphone of a first device collecting wake-up speech sent by a standard device, and the second audio is an audio obtained by a second microphone of a second device collecting the wake-up speech sent by the standard device;
A first determining module 34, coupled to the acquiring module 32, for determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio;
A second determining module 36, coupled to the first determining module 34, configured to determine a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value;
And the calibration module 38 is connected to the second determination module 36, and is configured to perform pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient, respectively.
It should be noted that the above-mentioned obtaining module 32, the first determining module 34, the second determining module 36 and the calibrating module 38 correspond to steps S202 to S208 in the embodiment, and the plurality of modules are the same as the examples and application scenarios implemented by the corresponding steps, but are not limited to those disclosed in the above-mentioned embodiment.
Embodiments of the present invention may provide a computer device, optionally in this embodiment, the computer device may be located in at least one network device of a plurality of network devices of a computer network. The computer device includes a memory and a processor.
The memory may be used to store software programs and modules, such as program instructions/modules corresponding to the pickup calibration method and apparatus in the embodiments of the present invention, and the processor executes the software programs and modules stored in the memory, thereby executing various functional applications and data processing, that is, implementing the pickup calibration method described above. The memory may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some examples, the memory may further include memory remotely located relative to the processor, which may be connected to the computer terminal via a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The processor may call the information and the application program stored in the memory through the transmission device to perform the following steps: acquiring first audio and second audio, wherein the first audio is obtained by collecting wake-up voice sent by standard equipment by a first microphone of first equipment, and the second audio is obtained by collecting the wake-up voice sent by the standard equipment by a second microphone of second equipment; determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value; and respectively carrying out pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.
Those of ordinary skill in the art will appreciate that all or part of the steps in the various methods of the above embodiments may be implemented by a program for instructing a terminal device to execute in association with hardware, the program may be stored in a computer readable storage medium, and the storage medium may include: flash disk, read-Only Memory (ROM), random-access Memory (Random Access Memory, RAM), magnetic disk or optical disk, etc.
Embodiments of the present invention also provide a computer-readable storage medium. Alternatively, in the present embodiment, the above-described computer-readable storage medium may be used to store the program code executed by the pickup calibration method provided in the above-described embodiment.
Alternatively, in this embodiment, the above-mentioned computer-readable storage medium may be located in any one of the computer terminals in the computer terminal group in the computer network, or in any one of the mobile terminals in the mobile terminal group.
Optionally, in the present embodiment, the computer readable storage medium is configured to store program code for performing the steps of: acquiring first audio and second audio, wherein the first audio is obtained by collecting wake-up voice sent by standard equipment by a first microphone of first equipment, and the second audio is obtained by collecting the wake-up voice sent by the standard equipment by a second microphone of second equipment; determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio; determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value; and respectively carrying out pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient.
The foregoing embodiment numbers of the present invention are merely for the purpose of description, and do not represent the advantages or disadvantages of the embodiments.
In the foregoing embodiments of the present invention, the descriptions of the embodiments are emphasized, and for a portion of this disclosure that is not described in detail in this embodiment, reference is made to the related descriptions of other embodiments.
In the several embodiments provided in the present application, it should be understood that the disclosed technology may be implemented in other manners. The above-described embodiments of the apparatus are merely exemplary, and the division of units may be a logic function division, and there may be another division manner in actual implementation, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be through some interfaces, units or modules, or may be in electrical or other forms.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed over a plurality of units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments of the present invention may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.
The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a non-volatile storage medium. Based on such understanding, the technical solution of the present invention may be embodied in essence or a part contributing to the prior art or all or part of the technical solution in the form of a software product stored in a storage medium, comprising several instructions for causing a computer device (which may be a personal computer, a server or a network device, etc.) to perform all or part of the steps of the method of the various embodiments of the present invention. And the aforementioned storage medium includes: a usb disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a removable hard disk, a magnetic disk, or an optical disk, or other various media capable of storing program codes.
The foregoing is merely a preferred embodiment of the present application and it should be noted that modifications and adaptations to those skilled in the art may be made without departing from the principles of the present application, which are intended to be comprehended within the scope of the present application.

Claims (8)

1. A pickup calibration method, comprising:
Acquiring first audio and second audio, wherein the first audio is obtained by collecting wake-up voice sent by standard equipment by a first microphone of first equipment, and the second audio is obtained by collecting the wake-up voice sent by the standard equipment by a second microphone of second equipment;
determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio;
Determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value;
performing pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient;
wherein the determining, based on the first energy value and the second energy value, a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device includes:
Acquiring third audio and fourth audio, wherein the third audio is audio obtained by collecting the wake-up voice sent by the first device through a standard microphone of the standard device, and the fourth audio is audio obtained by collecting the wake-up voice sent by the second device through the standard microphone of the standard device;
Determining a third energy value corresponding to the third audio and a fourth energy value corresponding to the fourth audio;
Determining the first calibration coefficient based on the first energy value and the third energy value, and determining the second calibration coefficient based on the second energy value and the fourth energy value;
wherein the determining the first calibration coefficient based on the first energy value and the third energy value, and the determining the second calibration coefficient based on the second energy value and the fourth energy value, comprises:
determining a ratio of the third energy value and the first energy value as the first calibration coefficient;
And determining a ratio of the fourth energy value and the second energy value as the second calibration coefficient;
wherein, according to the first calibration coefficient and the second calibration coefficient, pickup calibration is performed to the first microphone and the second microphone respectively, including:
In the case that the first microphone receives the first voice sent by the target object, the following formula is adopted: a=a-
A pickup calibration method, comprising:
Acquiring first audio and second audio, wherein the first audio is obtained by collecting wake-up voice sent by standard equipment by a first microphone of first equipment, and the second audio is obtained by collecting the wake-up voice sent by the standard equipment by a second microphone of second equipment;
determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio;
Determining a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value;
performing pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient;
wherein the determining, based on the first energy value and the second energy value, a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device includes:
Acquiring third audio and fourth audio, wherein the third audio is audio obtained by collecting the wake-up voice sent by the first device through a standard microphone of the standard device, and the fourth audio is audio obtained by collecting the wake-up voice sent by the second device through the standard microphone of the standard device;
Determining a third energy value corresponding to the third audio and a fourth energy value corresponding to the fourth audio;
Determining the first calibration coefficient based on the first energy value and the third energy value, and determining the second calibration coefficient based on the second energy value and the fourth energy value;
wherein the determining the first calibration coefficient based on the first energy value and the third energy value, and the determining the second calibration coefficient based on the second energy value and the fourth energy value, comprises:
determining a ratio of the third energy value and the first energy value as the first calibration coefficient;
And determining a ratio of the fourth energy value and the second energy value as the second calibration coefficient;
wherein, according to the first calibration coefficient and the second calibration coefficient, pickup calibration is performed to the first microphone and the second microphone respectively, including:
In the case that the first microphone receives the first voice sent by the target object, the following formula is adopted: a=a- Calibrating a first speech energy value of the first speech, wherein A is the calibrated first speech energy value, a is the first speech energy value,For the first calibration coefficient to be the same,Is the distance between the first microphone and the standard microphone;
And, in case the second microphone receives the first voice uttered by the target object, passing through the formula e=e + Calibrating a second speech energy value of the first speech, wherein E is the calibrated second speech energy value, E is the second speech energy value,For the second calibration factor to be used,Is the distance between the second microphone and the standard microphone.
2. The method as recited in claim 1, further comprising:
acquiring an online equipment list, wherein the online equipment list comprises a plurality of equipment;
Selecting one calibrated device in the online device list as the standard device, and selecting the first device and the second device in the online device list as devices to be calibrated;
And sending wake-up word data to the standard equipment, wherein the wake-up word data is used for generating the wake-up voice by the standard equipment.
3. The method as recited in claim 2, further comprising: and sending the wake-up word data to the first device and the second device, wherein the first device and the second device respond to wake-up voice sent by the standard device based on the wake-up word data, and call the first microphone and the second microphone to collect the wake-up voice sent by the standard device to obtain the first audio and the second audio respectively.
4. The method of claim 2, wherein the calibrating the pickup of the first microphone and the second microphone based on the first calibration coefficient and the second calibration coefficient, respectively, comprises:
transmitting the first calibration coefficient to the first device, wherein the first device calibrates the energy value of the received audio according to the first calibration coefficient;
And transmitting the second calibration coefficient to the second device, wherein the second device calibrates the energy value of the received audio according to the second calibration coefficient.
5. The method as recited in claim 1, further comprising:
acquiring fifth audio, wherein the fifth audio is audio obtained by acquiring wake-up voice sent by the standard equipment by a third microphone of the third equipment;
Determining a fifth energy value corresponding to the fifth audio;
determining a third correction coefficient corresponding to the third device based on the fifth energy value and any one of the first energy value and the second energy value;
And carrying out pickup calibration on the third microphone according to the third correction coefficient.
6. A pickup calibration apparatus, comprising:
The system comprises an acquisition module, a first audio acquisition module and a second audio acquisition module, wherein the first audio is obtained by a first microphone of a first device acquiring wake-up voice sent by a standard device, and the second audio is obtained by a second microphone of a second device acquiring the wake-up voice sent by the standard device;
The first determining module is used for determining a first energy value corresponding to the first audio and a second energy value corresponding to the second audio;
A second determining module, configured to determine a first calibration coefficient corresponding to the first device and a second calibration coefficient corresponding to the second device based on the first energy value and the second energy value;
the calibration module is used for respectively carrying out pickup calibration on the first microphone and the second microphone according to the first calibration coefficient and the second calibration coefficient;
The second determining module is further configured to obtain a third audio and a fourth audio, where the third audio is an audio obtained by collecting the wake-up voice sent by the first device by a standard microphone of the standard device, and the fourth audio is an audio obtained by collecting the wake-up voice sent by the second device by a standard microphone of the standard device; determining a third energy value corresponding to the third audio and a fourth energy value corresponding to the fourth audio; determining the first calibration coefficient based on the first energy value and the third energy value, and determining the second calibration coefficient based on the second energy value and the fourth energy value;
The second determining module is further configured to determine, as the first calibration coefficient, a ratio of the third energy value and the first energy value; and determining a ratio of the fourth energy value and the second energy value as the second calibration coefficient;
The calibration module is further configured to, when the first microphone receives a first voice sent by the target object, pass through the formula: a=a- Calibrating a first speech energy value of the first speech, wherein A is the calibrated first speech energy value, a is the first speech energy value,For the first calibration coefficient to be the same,Is the distance between the first microphone and the standard microphone;
And, in case the second microphone receives the first voice uttered by the target object, passing through the formula e=e + Calibrating a second speech energy value of the first speech, wherein E is the calibrated second speech energy value, E is the second speech energy value,For the second calibration factor to be used,Is the distance between the second microphone and the standard microphone.
7. A computer-readable storage medium, characterized in that the computer-readable storage medium comprises a stored program, wherein the program when run performs the pickup calibration method of any one of claims 1 to 5.
8. An electronic device comprising a memory and a processor, wherein the memory has stored therein a computer program, the processor being arranged to perform the pickup calibration method of any one of claims 1 to 5 by means of the computer program.
CN202210631086.2A 2022-06-06 2022-06-06 Pickup calibration method and device, nonvolatile storage medium and electronic equipment Active CN115148203B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210631086.2A CN115148203B (en) 2022-06-06 2022-06-06 Pickup calibration method and device, nonvolatile storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210631086.2A CN115148203B (en) 2022-06-06 2022-06-06 Pickup calibration method and device, nonvolatile storage medium and electronic equipment

Publications (2)

Publication Number Publication Date
CN115148203A CN115148203A (en) 2022-10-04
CN115148203B true CN115148203B (en) 2024-09-24

Family

ID=83406156

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210631086.2A Active CN115148203B (en) 2022-06-06 2022-06-06 Pickup calibration method and device, nonvolatile storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN115148203B (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113314098A (en) * 2020-02-27 2021-08-27 青岛海尔科技有限公司 Device calibration method and apparatus, storage medium, and electronic apparatus

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3713250B1 (en) * 2017-11-14 2023-04-05 Nippon Telegraph And Telephone Corporation Voice communication device, voice communication method, and program
CN108122563B (en) * 2017-12-19 2021-03-30 北京声智科技有限公司 Method for improving voice awakening rate and correcting DOA
EP3727134B8 (en) * 2017-12-22 2023-03-08 ResMed Sensor Technologies Limited Processor readable medium and corresponding method for health and medical sensing
CN109275084B (en) * 2018-09-12 2021-01-01 北京小米智能科技有限公司 Method, device, system, equipment and storage medium for testing microphone array
CN109451415A (en) * 2018-12-17 2019-03-08 深圳Tcl新技术有限公司 Microphone array auto-collation, device, equipment and storage medium
CN110223684A (en) * 2019-05-16 2019-09-10 华为技术有限公司 A kind of voice awakening method and equipment
JP6908142B1 (en) * 2020-01-27 2021-07-21 沖電気工業株式会社 Sound collecting device, sound collecting program, and sound collecting method
CN113808614A (en) * 2021-07-30 2021-12-17 北京声智科技有限公司 Sound energy value calibration and device wake-up method, device and storage medium

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113314098A (en) * 2020-02-27 2021-08-27 青岛海尔科技有限公司 Device calibration method and apparatus, storage medium, and electronic apparatus

Also Published As

Publication number Publication date
CN115148203A (en) 2022-10-04

Similar Documents

Publication Publication Date Title
CN115148203B (en) Pickup calibration method and device, nonvolatile storage medium and electronic equipment
CN115171703B (en) Distributed voice awakening method and device, storage medium and electronic device
CN114915514B (en) Method and device for processing intention, storage medium and electronic device
CN111131353B (en) Collaboration method and intelligent device group
CN115345225A (en) Method and device for determining recommended scene, storage medium and electronic device
CN118351842A (en) Voice interaction processing method and device, storage medium and electronic device
CN115734139A (en) Audio calibration method, receiving device, playing device and storage medium
CN116206618B (en) Equipment awakening method, storage medium and electronic device
CN112992137A (en) Voice interaction method and device, storage medium and electronic device
CN115314331B (en) Control method and device of intelligent terminal, storage medium and electronic device
CN118118282A (en) Audio playing method and device, storage medium and electronic device
CN116072124A (en) User identity recognition method, storage medium and electronic device
CN115171699B (en) Method and device for adjusting wake-up parameters, storage medium and electronic device
CN115148204B (en) Voice wakeup processing method and device, storage medium and electronic device
CN115001885B (en) Equipment control method and device, storage medium and electronic device
CN115419997B (en) Prompt message sending method and device, storage medium and electronic device
CN115312049B (en) Instruction response method, storage medium and electronic device
CN113793608B (en) Method and device for controlling intelligent household appliances through voice
CN117524231A (en) Voice person identification method, voice interaction method and device
CN118351632A (en) Voice prompt method and device for door lock, storage medium, electronic device and computer program product
CN114842842A (en) Voice interaction method and device of intelligent equipment and storage medium
CN117240874A (en) Equipment linkage method and device, storage medium and electronic device
CN116206618A (en) Equipment awakening method, storage medium and electronic device
CN117014246A (en) Control method of intelligent equipment, storage medium and electronic device
CN116403575A (en) Wake-free voice interaction method and device, storage medium and electronic device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant