CN110941737B

CN110941737B - Single-machine voice storage method, device and equipment and readable storage medium

Info

Publication number: CN110941737B
Application number: CN201911247024.6A
Authority: CN
Inventors: 彭辉; 黎智勇; 许敏强
Original assignee: Guangzhou Speakin Intelligent Technology Co ltd
Current assignee: Guangzhou Speakin Intelligent Technology Co ltd
Priority date: 2019-12-06
Filing date: 2019-12-06
Publication date: 2023-01-20
Anticipated expiration: 2039-12-06
Also published as: CN110941737A

Abstract

The application discloses a stand-alone voice storage method, device, equipment, and readable storage medium. The method includes collecting voice information and judging whether the uploading condition is satisfied; if the uploading condition is not satisfied, obtaining dimension information of the voice information; Store to the target storage partition pointed to by the dimension information in the collection terminal storage partition, and add the index mark of the voice information to the target storage partition; when it is detected that the voice information meets the preset upload condition, determine the target storage partition based on the index mark , upload the voice information and corresponding dimension information in the target bucket. After the voice information is collected, judge whether the uploading condition is met or not. According to the dimension information, find the target storage partition in the collection terminal, store the voice information and create an index mark. If the uploading condition is met later, according to the index mark, quickly Determine the target bucket and upload voice and dimension information.

Description

Stand-alone voice storage method, device, equipment and readable storage medium

技术领域technical field

本发明涉及数据存储技术领域，尤其涉及一种单机语音存储方法、装置、设备及可读介质。The invention relates to the technical field of data storage, in particular to a stand-alone voice storage method, device, equipment and readable medium.

背景技术Background technique

当公安部门采集到语音信息后，需要将语音信息上传到公安声纹库进行声纹比对，但是有时候受限于当前的条件状况无法直接将声纹上传，此时需要对于采集到的声纹信息进行存储，之后再将存储的声纹信息上传到公安声纹库中进行比对。After the public security department collects the voice information, it needs to upload the voice information to the public security voiceprint database for voiceprint comparison, but sometimes it is not possible to directly upload the voiceprint due to the current conditions. Store the voiceprint information, and then upload the stored voiceprint information to the public security voiceprint database for comparison.

现有的存储方法是将采集的语音信息随机存储到本地存储器中，当满足上传条件时，重新将采集的语音信息上传到公安声纹库，这种存储方法在再次尝试上传语音信息时查找需要上传的语音信息较为麻烦，或者直接上传所有的语音信息，这样会耗费较长时间。The existing storage method is to randomly store the collected voice information in the local memory, and when the upload condition is met, upload the collected voice information to the public security voiceprint database again. Uploading voice information is cumbersome, or directly uploading all voice information will take a long time.

发明内容Contents of the invention

本申请的主要目的在于提供一种单机语音存储方法、装置、设备及可读存储介质，旨在解决存储采集的语音信息后上传不便的问题。The main purpose of this application is to provide a stand-alone voice storage method, device, equipment and readable storage medium, aiming at solving the problem of inconvenient uploading after storing and collecting voice information.

为实现上述目的，本申请提供的一种单机语音存储方法，所述增量数据的聚类方法包括以下步骤：In order to achieve the above object, the application provides a stand-alone voice storage method, the incremental data clustering method includes the following steps:

采集语音信息，判断是否满足上传条件；Collect voice information and judge whether the uploading conditions are met;

若不满足上传条件，则获取所述语音信息的维度信息；If the uploading condition is not satisfied, acquiring dimension information of the voice information;

将所述语音信息存储到所述维度信息在采集终端存储分区中所指向的目标存储分区，并为所述存储分区添加所述语音信息的索引标记；storing the voice information in the target storage partition pointed to by the dimension information in the collection terminal storage partition, and adding an index mark of the voice information to the storage partition;

当检测到所述语音信息满足预设上传条件时，基于所述索引标记确定所述目标存储分区，将所述目标存储分区中所述语音信息和对应的所述维度信息上传。When it is detected that the voice information satisfies the preset upload condition, the target storage partition is determined based on the index mark, and the voice information and the corresponding dimension information in the target storage partition are uploaded.

可选地，所述采集语音信息的步骤包括：Optionally, the step of collecting voice information includes:

采集语音信息；Collect voice information;

分析所述语音信息的话者个数；analyzing the number of speakers of the voice information;

根据所述话者个数，将所述语音信息分离，得到各所述话者的语音信息。According to the number of speakers, the voice information is separated to obtain the voice information of each speaker.

可选地，所述判断是否能够满足上传条件的步骤包括：Optionally, the step of judging whether the upload condition can be met includes:

与公安声纹库建立连接；Establish a connection with the public security voiceprint database;

获取所述连接结果；Obtain the connection result;

若连接成功，则满足上传条件；If the connection is successful, the upload condition is met;

若连接失败，则不满足上传条件。If the connection fails, the upload condition is not met.

可选地，所述获取所述语音信息的维度信息的步骤包括：Optionally, the step of acquiring dimension information of the voice information includes:

确定所述语音信息的语音特征信息；determining voice feature information of the voice information;

确定所述语音信息的采集地点信息；Determining the collection location information of the voice information;

确定所述语音信息的采集时间信息；determining the collection time information of the voice information;

根据所述语音特征信息、所述采集地点信息和所述采集时间信息形成所述维度信息。The dimension information is formed according to the voice feature information, the collection location information and the collection time information.

可选地，所述将所述语音信息存储到所述维度信息在采集终端存储分区中所指向的目标存储分区，并为所述目标存储分区添加所述语音信息的索引标记的步骤包括：Optionally, the step of storing the voice information in the target storage partition pointed to by the dimension information in the collection terminal storage partition, and adding an index mark of the voice information to the target storage partition includes:

根据所述采集地点信息，找到采集终端中指向的所述目标存储分区；Find the target storage partition pointed to by the collection terminal according to the collection location information;

根据所述语音特征信息，将所述语音信息与所述目标存储分区中的存储信息进行相似度对比；performing a similarity comparison between the voice information and the stored information in the target storage partition according to the voice feature information;

若所述相似度大于预设值，则将所述语音信息存储到所述目标存储分区中；If the similarity is greater than a preset value, storing the voice information in the target storage partition;

根据所述采集时间信息，对所述目标存储分区添加所述索引标记。Add the index mark to the target storage partition according to the collection time information.

可选地，所述将所述语音信息与所述目标存储分区中的存储信息进行相似度对比的步骤之后包括：Optionally, after the step of comparing the similarity between the voice information and the stored information in the target storage partition includes:

若所述相似度不大于预设值，则按照所述采集地点信息，建立临时存储分区；If the similarity is not greater than a preset value, a temporary storage partition is established according to the collection location information;

将所述语音信息按照所述采集地点信息存储到所述临时存储分区中。The voice information is stored in the temporary storage partition according to the collection location information.

当所述临时存储分区中的语音信息数量超过预设数量，则将所述临时存储分区变为固定存储分区；When the amount of voice information in the temporary storage partition exceeds a preset number, then change the temporary storage partition into a fixed storage partition;

根据所述固定存储分区中语音信息的采集时间信息，对所述固定存储分区添加索引标记。An index mark is added to the fixed storage partition according to the collection time information of the voice information in the fixed storage partition.

可选地，所述基于所述索引标记确定所述目标存储分区，将所述目标存储分区中所述语音信息和对应的所述维度信息上传的步骤包括：Optionally, the step of determining the target storage partition based on the index mark, and uploading the voice information and the corresponding dimension information in the target storage partition includes:

获取上一次上传语音信息的上传时间；Obtain the upload time of the last uploaded voice message;

获取各存储分区的索引标记，根据所述上传时间，确定所述目标存储分区；Obtain the index mark of each storage partition, and determine the target storage partition according to the upload time;

根据所述目标存储分区中语音信息的采集时间信息，将所述所述目标存储分区中所述语音信息和对应的所述维度信息上传。Upload the voice information in the target storage partition and the corresponding dimension information according to the collection time information of the voice information in the target storage partition.

本申请还提供一种单机语音存储装置，所述单机语音存储装置包括：The present application also provides a stand-alone voice storage device, the stand-alone voice storage device comprising:

判断模块，用于采集语音信息，判断是否满足上传条件；Judging module, used for collecting voice information, judging whether the uploading condition is satisfied;

获取模块，用于若不满足上传条件，则获取所述语音信息的维度信息；An acquisition module, configured to acquire dimension information of the voice information if the uploading condition is not met;

存储模块，用于将所述语音信息存储到所述维度信息在采集终端存储分区中所指向的目标存储分区，并为所述目标存储分区添加所述语音信息的索引标记；A storage module, configured to store the voice information in a target storage partition pointed to by the dimension information in the collection terminal storage partition, and add an index mark of the voice information to the target storage partition;

上传模块，用于当检测到所述语音信息满足预设上传条件时，基于所述索引标记确定所述目标存储分区，将所述目标存储分区中所述语音信息和对应的所述维度信息上传。An upload module, configured to determine the target storage partition based on the index mark when it is detected that the voice information satisfies a preset upload condition, and upload the voice information and the corresponding dimension information in the target storage partition .

本申请还提供一种单机语音存储设备，所述单机语音存储设备包括：存储器、处理器及存储在所述存储器上并可在所述处理器上运行的单机语音存储程序，所述单机语音存储程序被所述处理器执行时实现如上述的单机语音存储方法的步骤。The present application also provides a stand-alone voice storage device. The stand-alone voice storage device includes: a memory, a processor, and a stand-alone voice storage program stored in the memory and operable on the processor. The stand-alone voice storage When the program is executed by the processor, the steps of the above-mentioned stand-alone voice storage method are realized.

本申请还提供一种可读存储介质，所述可读存储介质上存储有计算机程序，所述计算机程序被处理器执行时实现如上述的单机语音存储方法的步骤。The present application also provides a readable storage medium, on which a computer program is stored, and when the computer program is executed by a processor, the steps of the above-mentioned stand-alone voice storage method are implemented.

本申请通过采集语音信息，判断是否满足上传条件；若不满足上传条件，则获取所述语音信息的维度信息；将所述语音信息存储到所述维度信息在采集终端存储分区中所指向的目标存储分区，并为所述目标存储分区添加所述语音信息的索引标记；当检测到所述语音信息满足预设上传条件时，基于所述索引标记确定所述目标存储分区，将所述目标存储分区中所述语音信息和对应的所述维度信息上传。采集到语音信息后，判断能否直接上传到公安声纹库中，上传不了的语音信息，按照语音信息的维度信息，找到采集终端中的目标存储分区，将语音信息存储到目标存储分区中并建立索引标记，之后满足上传条件时，根据索引标记，快速确定目标存储分区并上传语音信息和维度信息，方便与公安声纹库中的声纹信息进行对比。This application collects the voice information to determine whether the uploading condition is met; if the uploading condition is not met, obtain the dimension information of the voice information; store the voice information in the target pointed to by the dimension information in the storage partition of the collection terminal storage partition, and add an index mark of the voice information for the target storage partition; when it is detected that the voice information satisfies a preset upload condition, determine the target storage partition based on the index mark, and store the target The voice information in the partition and the corresponding dimension information are uploaded. After the voice information is collected, determine whether it can be directly uploaded to the public security voiceprint database. If the voice information cannot be uploaded, according to the dimension information of the voice information, find the target storage partition in the collection terminal, store the voice information in the target storage partition and Create an index mark, and then when the uploading conditions are met, quickly determine the target storage partition according to the index mark and upload the voice information and dimension information, which is convenient for comparison with the voiceprint information in the public security voiceprint database.

附图说明Description of drawings

此处的附图被并入说明书中并构成本说明书的一部分，示出了符合本申请的实施例，并与说明书一起用于解释本申请的原理。The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the application and together with the description serve to explain the principles of the application.

为了更清楚地说明本申请实施例或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍，显而易见地，对于本领域普通技术人员而言，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, for those of ordinary skill in the art, In other words, other drawings can also be obtained from these drawings without paying creative labor.

图1为本申请实施例方案涉及的硬件运行环境的装置结构示意图；FIG. 1 is a schematic diagram of the device structure of the hardware operating environment involved in the solution of the embodiment of the present application;

图2为本申请单机语音存储方法第一实施例的流程示意图；Fig. 2 is the schematic flow chart of the first embodiment of the stand-alone voice storage method of the present application;

图3为本申请单机语音存储方法第二实施例中对于图2步骤S10的细化流程图；Fig. 3 is the refinement flowchart of step S10 in Fig. 2 in the second embodiment of the stand-alone voice storage method of the present application;

图4为本申请单机语音存储方法第三实施例中对于图2步骤S20的细化流程图；Fig. 4 is the refinement flowchart of step S20 in Fig. 2 in the third embodiment of the stand-alone voice storage method of the present application;

图5为本申请单机语音存储方法第四实施例中对于图2步骤S30的细化流程图；FIG. 5 is a detailed flow chart of step S30 in FIG. 2 in the fourth embodiment of the stand-alone voice storage method of the present application;

图6为本申请单机语音存储方法第四实施例中对于图2步骤S40的细化流程图；FIG. 6 is a detailed flow chart of step S40 in FIG. 2 in the fourth embodiment of the stand-alone voice storage method of the present application;

图7为本申请单机语音存储设备一实施例的系统结构示意图。FIG. 7 is a schematic diagram of the system structure of an embodiment of the stand-alone voice storage device of the present application.

本申请目的的实现、功能特点及优点将结合实施例，参照附图做进一步说明。The realization, functional features and advantages of the present application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.

具体实施方式Detailed ways

应当理解，此处所描述的具体实施例仅仅用以解释本申请，并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

在后续的描述中，使用用于表示元件的诸如“模块”、“部件”或“单元”的后缀仅为了有利于本发明的说明，其本身没有特定的意义。因此，“模块”、“部件”或“单元”可以混合地使用。In the following description, use of suffixes such as 'module', 'part' or 'unit' for denoting elements is only for facilitating description of the present invention and has no specific meaning by itself. Therefore, 'module', 'part' or 'unit' may be used in combination.

如图1所示，图1是本申请实施例方案涉及的硬件运行环境的终端结构示意图。As shown in FIG. 1 , FIG. 1 is a schematic diagram of a terminal structure of a hardware operating environment involved in the solution of the embodiment of the present application.

本申请实施例终端为单机语音存储设备。The terminal in the embodiment of the present application is a stand-alone voice storage device.

如图1所示，该终端可以包括：处理器1001，例如CPU，网络接口1004，用户接口1003，存储器1005，通信总线1002。其中，通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard)，可选用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。存储器1005可以是高速RAM存储器，也可以是稳定的存储器(non-volatile memory)，例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储装置。As shown in FIG. 1 , the terminal may include: a processor 1001 , such as a CPU, a network interface 1004 , a user interface 1003 , a memory 1005 , and a communication bus 1002 . Wherein, the communication bus 1002 is used to realize connection and communication between these components. The user interface 1003 may include a display screen (Display), an input unit such as a keyboard (Keyboard), and the optional user interface 1003 may also include a standard wired interface and a wireless interface. Optionally, the network interface 1004 may include a standard wired interface and a wireless interface (such as a WI-FI interface). The memory 1005 can be a high-speed RAM memory, or a stable memory (non-volatile memory), such as a disk memory. Optionally, the memory 1005 may also be a storage device independent of the aforementioned processor 1001 .

可选地，终端还可以包括摄像头、RF(Radio Frequency，射频)电路，传感器、音频电路、WiFi模块等等。其中，传感器比如光传感器、运动传感器以及其他传感器。具体地，光传感器可包括环境光传感器及接近传感器，其中，环境光传感器可根据环境光线的明暗来调节显示屏的亮度，接近传感器可在终端设备移动到耳边时，关闭显示屏和/或背光。当然，终端设备还可配置陀螺仪、气压计、湿度计、温度计、红外线传感器等其他传感器，在此不再赘述。Optionally, the terminal may further include a camera, an RF (Radio Frequency, radio frequency) circuit, a sensor, an audio circuit, a WiFi module, and the like. Among them, sensors such as light sensors, motion sensors and other sensors. Specifically, the light sensor may include an ambient light sensor and a proximity sensor, where the ambient light sensor may adjust the brightness of the display screen according to the brightness of the ambient light, and the proximity sensor may turn off the display screen and/or backlight. Of course, the terminal device may also be configured with other sensors such as a gyroscope, a barometer, a hygrometer, a thermometer, and an infrared sensor, which will not be repeated here.

本领域技术人员可以理解，图1中示出的终端结构并不构成对终端的限定，可以包括比图示更多或更少的部件，或者组合某些部件，或者不同的部件布置。Those skilled in the art can understand that the terminal structure shown in FIG. 1 does not constitute a limitation on the terminal, and may include more or less components than those shown in the figure, or combine some components, or arrange different components.

如图1所示，作为一种计算机存储介质的存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及单机语音存储程序。As shown in FIG. 1 , the memory 1005 as a computer storage medium may include an operating system, a network communication module, a user interface module, and a stand-alone voice storage program.

在图1所示的终端中，网络接口1004主要用于连接后台服务器，与后台服务器进行数据通信；用户接口1003主要用于连接客户端(用户端)，与客户端进行数据通信；而处理器1001可以用于调用存储器1005中存储的单机语音存储程序，并执行以下操作：In the terminal shown in Figure 1, the network interface 1004 is mainly used to connect to the background server and perform data communication with the background server; the user interface 1003 is mainly used to connect to the client (client) and perform data communication with the client; and the processor 1001 can be used to call the stand-alone voice storage program stored in memory 1005, and perform the following operations:

将所述语音信息存储到所述维度信息在采集终端存储分区中所指向的目标存储分区，并为所述目标存储分区添加所述语音信息的索引标记；storing the voice information in the target storage partition pointed to by the dimension information in the collection terminal storage partition, and adding an index mark of the voice information to the target storage partition;

基于上述终端硬件结构，提出本申请各个实施例。Based on the foregoing terminal hardware structure, various embodiments of the present application are proposed.

本申请提供一种单机语音存储方法。This application provides a stand-alone voice storage method.

参照图2，在单机语音存储方法第一实施例中，该方法包括：Referring to Fig. 2, in the first embodiment of stand-alone voice storage method, the method includes:

步骤S10，采集语音信息，判断是否满足上传条件；Step S10, collecting voice information, and judging whether the uploading condition is satisfied;

公安部门在进行案件调查时，需要采集语音信息，将采集到的语音信息上传到公安声纹库与声纹库中的声纹信息进行匹配对比，若匹配到相似的声纹信息，则说明采集的语音信息来源可能存在犯罪嫌疑。但是为了保证公安声纹库的信息安全，上传条件较为严苛，上传信息需要经过严格的安全性评估，上传语音信息需要检测上传环境，如上传地点、网络的流量状况等，若成功与公安声纹库建立连接，则上传语音信息，若不能与公安声纹库建立联系，则需要将语音信息保存在本地存储器中。When the public security department conducts a case investigation, it needs to collect voice information, upload the collected voice information to the public security voiceprint database and compare it with the voiceprint information in the voiceprint database. If similar voiceprint information is matched, it means that the collection The source of voice information may be suspected of crime. However, in order to ensure the information security of the public security voiceprint database, the uploading conditions are relatively strict. Uploading information needs to undergo strict security assessment. Uploading voice information needs to detect the uploading environment, such as the uploading location and network traffic conditions. If the connection is established with the voiceprint database, the voice information will be uploaded. If the connection with the public security voiceprint database cannot be established, the voice information needs to be saved in the local memory.

步骤S20，若不满足上传条件，则获取所述语音信息的维度信息；Step S20, if the uploading condition is not satisfied, then obtain dimension information of the voice information;

当不满足上传条件，获取所述语音信息的维度信息，语音信息的维度信息包括语音特征信息、采集地点信息和采集时间信息，语音特征信息包括语音的音高、音色等能对于语音进行分析比较的特征信息，采集地点信息是采集到语音信息的地点，采集时间信息是采集到语音信息的时间。When the uploading conditions are not met, the dimensional information of the voice information is obtained. The dimensional information of the voice information includes voice feature information, collection location information, and collection time information. The voice feature information includes voice pitch, timbre, etc., which can be analyzed and compared with the voice. The feature information of the collection location information is the place where the voice information is collected, and the collection time information is the time when the voice information is collected.

步骤S30，根据所述维度信息，将所述语音信息存储到所述维度信息在采集终端对应的存储分区中所指向的目标存储分区，并对为所述目标存储分区进行添加所述语音信息的索引标记；Step S30, according to the dimension information, store the voice information in the target storage partition pointed to by the dimension information in the storage partition corresponding to the collection terminal, and add the voice information to the target storage partition index mark;

终端中本身存储有部分从公安声纹库中导出的语音信息，从公安声纹库导出的语音信息按照采集时的地点信息进行分类，同时按照采集的时间先后顺序进行排序，对于新采集到的语音信息，根据维度信息中的采集地点信息先初步确定可能的目标存储分区，再利用语音特征信息与之前已经存储的采集终端中的语音信息进行匹配，如果有相似度较大的语音信息，则将采集到的语音信息存储到对应的目标存储分区中，并根据维度信息对于目标存储分区添加索引标记，如果没有找到合适的已有的存储分区，则按照语音信息的采集时间信息建立新的临时存储分区，将采集的语音信息存储到新的临时存储分区中。The terminal itself stores part of the voice information derived from the public security voiceprint database. The voice information derived from the public security voiceprint database is classified according to the location information at the time of collection, and sorted according to the time sequence of collection. For newly collected Voice information, according to the collection location information in the dimension information, first determine the possible target storage partitions, and then use the voice feature information to match the voice information in the collection terminal that has been stored before. If there is voice information with a high degree of similarity, then Store the collected voice information in the corresponding target storage partition, and add index marks to the target storage partition according to the dimension information. If no suitable existing storage partition is found, create a new temporary one according to the collection time information of the voice information The storage partition stores the collected voice information into a new temporary storage partition.

步骤S40，当能上传检测到所述语音信息满足预设上传条件时，基于所述索引从所述标记的确定所述目标存储分区中，根据所述维度信息，将所述目标存储分区中所述语音信息和对应的所述维度信息上传；Step S40, when it is detected that the voice information satisfies the preset upload condition, based on the index, from the target storage partition determined by the mark, according to the dimension information, upload all the voice information in the target storage partition Upload the above voice information and the corresponding dimension information;

当检测到当前环境重新满足预设上传条件时，则根据存储分区的索引标记，确定有进行过数据更新的目标存储分区，根据语音信息的维度信息中的采集时间信息，将新采集的语音信息和对应的维度信息上传到公安声纹库中。When it is detected that the current environment satisfies the preset upload condition again, according to the index mark of the storage partition, it is determined that there is a target storage partition that has undergone data update, and according to the collection time information in the dimension information of the voice information, the newly collected voice information And the corresponding dimension information is uploaded to the public security voiceprint database.

在本实施例中，通过采集语音信息，判断是否满足上传条件；若不满足上传条件，则获取所述语音信息的维度信息；将所述语音信息存储到所述维度信息在采集终端存储分区中所指向的目标存储分区，并为所述目标存储分区添加所述语音信息的索引标记；当检测到所述语音信息满足预设上传条件时，基于所述索引标记确定所述目标存储分区，将所述目标存储分区中所述语音信息和对应的所述维度信息上传。采集到语音信息后，判断能否直接上传到公安声纹库中，上传不了的语音信息，按照语音信息的维度信息，找到采集终端中的目标存储分区，将语音信息存储到目标存储分区中并建立索引标记，之后满足上传条件时，根据索引标记，快速确定目标存储分区并上传语音信息和维度信息，方便与公安声纹库中的声纹信息进行对比。In this embodiment, by collecting voice information, it is judged whether the uploading condition is satisfied; if the uploading condition is not satisfied, the dimension information of the voice information is obtained; the voice information is stored in the dimension information in the storage partition of the collection terminal The target storage partition pointed to, and add the index mark of the voice information for the target storage partition; when it is detected that the voice information meets the preset upload condition, determine the target storage partition based on the index mark, and add The voice information and the corresponding dimension information in the target storage partition are uploaded. After the voice information is collected, determine whether it can be directly uploaded to the public security voiceprint database. If the voice information cannot be uploaded, according to the dimension information of the voice information, find the target storage partition in the collection terminal, store the voice information in the target storage partition and Create an index mark, and then when the uploading conditions are met, quickly determine the target storage partition according to the index mark and upload the voice information and dimension information, which is convenient for comparison with the voiceprint information in the public security voiceprint database.

进一步地，参照图2和图3，在本申请单机语音存储方法第一实施例的基础上，提供单机语音存储方法第二实施例，在第二实施例中，Further, referring to FIG. 2 and FIG. 3 , on the basis of the first embodiment of the stand-alone voice storage method of the present application, a second embodiment of the stand-alone voice storage method is provided. In the second embodiment,

步骤S10包括：Step S10 includes:

步骤S11，采集语音信息；Step S11, collecting voice information;

通过指定的认证过的语音采集设备采集现场的语音信息，这样经过认证的采集设备采集的语音信息才能上传到公安声纹库中，同时采集时记录采集设备编号，采集人员编号、采集地点等信息。The on-site voice information is collected through the designated certified voice collection equipment, so that the voice information collected by the certified collection equipment can be uploaded to the public security voiceprint database, and the collection equipment number, collection personnel number, collection location and other information are recorded during collection. .

步骤S12，分析所述语音信息的话者个数；Step S12, analyzing the number of speakers of the voice information;

因为采集语音信息的环境比较复杂，通常会采集到多个话者混杂在一起的语音信息，通过采集设备的语音识别功能，识别采集到的语音信息，根据语音特征信息，将采集的语音信息进行聚类，聚类的聚类数就是话者个数。Because the environment for collecting voice information is relatively complicated, it usually collects voice information mixed with multiple speakers. Through the voice recognition function of the collection device, the collected voice information is recognized, and the collected voice information is processed according to the voice feature information. Clustering, the clustering number of clustering is the number of speakers.

步骤S13，根据所述话者个数，将所述语音信息分离，得到各所述话者的语音信息；Step S13, according to the number of the speakers, separate the voice information to obtain the voice information of each speaker;

根据识别的话者个数，将采集的语音信息进行分离，将每个话者对应的语音信息分别提取出来，同时将采集时间、采集地点等信息添加到每一个分离得到的语音信息中。According to the number of identified speakers, the collected voice information is separated, the voice information corresponding to each speaker is extracted separately, and information such as collection time and collection location is added to each separated voice information.

步骤S14，与公安声纹库建立连接；Step S14, establishing a connection with the public security voiceprint database;

公安声纹库存储有公安部门采集的犯罪嫌疑人的声纹信息，属于比较机密的信息，连接公安声纹库需要经过安全验证，采集语音信息后，通过连接公安声纹库，通过与公安声纹库中的信息进行比对判断是否存在相似的嫌疑人数据。The public security voiceprint database stores the voiceprint information of criminal suspects collected by the public security department, which is relatively confidential information. The connection to the public security voiceprint database requires security verification. The information in the tattoo database is compared to determine whether there is similar suspect data.

步骤S15，获取所述连接结果；Step S15, obtaining the connection result;

因为公安声纹库中保存的信息较为机密，因此对于与公安声纹库连接的环境有着更加严格的要求，如在网络不好传输受限的环境，或者处于境外等可能被窃取信息的网络环境的情况下，为了保证公安声纹库中的信息安全，在环境条件无法达到连接要求时，无法连接到公安声纹库中，所以在尝试连接到公安声纹库后获取连接结果。Because the information stored in the public security voiceprint database is relatively confidential, there are stricter requirements for the environment connected to the public security voiceprint database, such as in an environment where the network is not good and transmission is limited, or in a network environment where information may be stolen, such as overseas In this case, in order to ensure the information security in the public security voiceprint database, when the environmental conditions cannot meet the connection requirements, the public security voiceprint database cannot be connected, so after trying to connect to the public security voiceprint database, the connection result is obtained.

步骤S16，若连接成功，则满足上传条件；Step S16, if the connection is successful, the upload condition is met;

步骤S17，若连接失败，则不满足上传条件；Step S17, if the connection fails, the upload condition is not satisfied;

当连接成功，则自动上传语音信息，而连接失败时，无法上传语音信息，则需要将语音信息存储到本地存储器中等待之后上传。When the connection is successful, the voice information will be automatically uploaded, and when the connection fails, the voice information cannot be uploaded, and the voice information needs to be stored in the local storage and uploaded later.

在本实施例中，采集到语音信息，识别语音信息中包含的话者个数，将每个话者的语音信息进行分离，同时尝试连接到公安声纹库，判断与公安声纹库的连接结果，确定接下来的操作，连接成功，自动上传语音信息到公安声纹库，连接失败，则将语音信息存储到本地存储器中。In this embodiment, the voice information is collected, the number of speakers contained in the voice information is identified, the voice information of each speaker is separated, and at the same time, an attempt is made to connect to the public security voiceprint database, and the connection result with the public security voiceprint database is judged , to determine the next operation, if the connection is successful, the voice information will be automatically uploaded to the public security voiceprint database; if the connection fails, the voice information will be stored in the local memory.

进一步地，参照图2和图4，在本申请单机语音存储方法第二实施例的基础上，提供单机语音存储方法第三实施例，在第三实施例中，Further, referring to FIG. 2 and FIG. 4 , on the basis of the second embodiment of the stand-alone voice storage method of the present application, a third embodiment of the stand-alone voice storage method is provided. In the third embodiment,

步骤S20包括：Step S20 includes:

步骤S21，确定所述语音信息的语音特征信息；Step S21, determining the voice characteristic information of the voice information;

步骤S22，确定所述语音信息的采集地点信息；Step S22, determining the collection location information of the voice information;

步骤S23，确定所述语音信息的采集时间信息；Step S23, determining the collection time information of the voice information;

步骤S24，根据所述语音特征信息、所述采集地点信息和所述采集时间信息形成所述维度信息；Step S24, forming the dimension information according to the voice feature information, the collection location information and the collection time information;

语音特征信息由分析语音信息得到，语音特征信息包括语音的音高、振幅、频率等信息，采集地点是指采集到语音信息的地点、采集时间是采集到语音信息的时间。采集语音信息是为了判断现场是否有嫌疑人存在，将采集到的语音信息与公安声纹库中的信息进行对比。通常情况下，嫌疑人的活动范围会比较固定，容易出现在一些固定的地点，采集地点信息与采集时间信息可以构成语音信息的案件维度，即通过时间与地点判断案件可能相似的已有案件，再根据语音特征信息与公安声纹库中的可能相似的案件的采集到的语音信息进行对比，确认是否会是同一个人。The voice feature information is obtained by analyzing the voice information. The voice feature information includes the pitch, amplitude, frequency and other information of the voice. The collection location refers to the place where the voice information is collected, and the collection time refers to the time when the voice information is collected. The purpose of collecting voice information is to determine whether there is a suspect at the scene, and compare the collected voice information with the information in the public security voiceprint database. Usually, the range of activities of suspects is relatively fixed, and they tend to appear in some fixed places. The collection location information and collection time information can constitute the case dimension of voice information, that is, the existing cases that may be similar to the case are judged by time and place. Then, compare the voice feature information with the voice information collected from possibly similar cases in the public security voiceprint database to confirm whether it is the same person.

在本实施例中，分析采集到的语音信息，得到语音信息的维度信息，维度信息可用于与公安声纹库中的语音信息匹配，还可用于之后将语音信息存储在本地存储器中。In this embodiment, the collected voice information is analyzed to obtain dimension information of the voice information. The dimension information can be used to match the voice information in the public security voiceprint database, and can also be used to store the voice information in a local memory later.

进一步地，参照图2和图5，在本申请单机语音存储方法第三实施例的基础上，提供单机语音存储方法第四实施例，在第四实施例中，Further, referring to FIG. 2 and FIG. 5 , on the basis of the third embodiment of the stand-alone voice storage method of the present application, a fourth embodiment of the stand-alone voice storage method is provided. In the fourth embodiment,

步骤S30包括：Step S30 includes:

步骤S31，根据所述采集地点信息，找到采集终端中指向的所述目标存储分区；Step S31, according to the collection location information, find the target storage partition pointed to by the collection terminal;

终端中本身存储有已经从公安声纹库中导出的部分语音信息，从公安声纹库中导出的语音信息是按照采集到该语音信息的地点信息进行分区存储，采集地点的划分可以以市作为划分范围，也可以以省作为划分范围，即存储分区的划分可以按照实际情况进行调整，采集到新的语音信息后，先根据语音信息的采集地点信息，初步确定目标存储分区。The terminal itself stores part of the voice information that has been exported from the public security voiceprint database. The voice information derived from the public security voiceprint database is stored in partitions according to the location information where the voice information was collected. The collection location can be divided into cities as The scope of division can also be divided into provinces, that is, the division of storage partitions can be adjusted according to the actual situation. After new voice information is collected, the target storage partition is initially determined according to the location information of the voice information.

步骤S32，根据所述语音特征信息，将所述语音信息与所述目标存储分区中的存储信息进行相似度对比；Step S32, comparing the similarity between the voice information and the stored information in the target storage partition according to the voice feature information;

根据采集地点信息，初步确定语音信息可能的目标存储分区后，根据语音特征信息，确定是否存在与采集到的语音信息相似的已有语音信息，如果有相似的已有语音信息，则将语音信息存储到对应的目标存储分区中，之后在上传时可按照现在语音信息所在的存储分区将语音信息上传到公安声纹库中对应的存储分区中。According to the collection location information, after preliminarily determining the possible target storage partition of the voice information, according to the voice feature information, determine whether there is existing voice information similar to the collected voice information, if there is similar existing voice information, the voice information Store in the corresponding target storage partition, and then upload the voice information to the corresponding storage partition in the public security voiceprint database according to the current storage partition where the voice information is located when uploading.

步骤S33，若所述相似度大于预设值，则将所述语音信息存储到所述目标存储分区中；Step S33, if the similarity is greater than a preset value, storing the voice information in the target storage partition;

通过采集的语音信息与存储的语音信息的相似度比较，如果相似程度较高，则存储到对应的目标存储分区中。By comparing the similarity between the collected voice information and the stored voice information, if the similarity is high, it is stored in the corresponding target storage partition.

步骤S34，根据所述采集时间信息，对所述目标存储分区添加所述索引标记；Step S34, adding the index mark to the target storage partition according to the collection time information;

对于进行过数据更新的存储分区，根据存储到存储分区中的语音信息的采集时间信息，对存储分区添加索引标记，索引标记可以记录存储分区中最后更新的语音信息的采集时间。For the storage partition that has undergone data update, according to the collection time information of the voice information stored in the storage partition, an index mark is added to the storage partition, and the index mark can record the collection time of the last updated voice information in the storage partition.

步骤S35，若所述相似度不大于预设值，则按照所述采集地点信息，建立临时存储分区；Step S35, if the similarity is not greater than a preset value, then according to the collection location information, a temporary storage partition is established;

相似度较小，说明该语音信息与目前存储的语音信息关联较小，无法直接放入相应的存储分区中，则按照采集地点信息，再次建立新的临时存储分区存储语音信息，对于新的临时存储分区，语音信息的采集时间信息与存储分区中已有的语音信息不同。If the similarity is small, it means that the voice information is less related to the currently stored voice information and cannot be directly put into the corresponding storage partition. Then, according to the collection location information, a new temporary storage partition is established again to store the voice information. For the new temporary In the storage partition, the collection time information of the voice information is different from the existing voice information in the storage partition.

步骤S36，将所述语音信息按照所述采集地点信息存储到所述临时存储分区中；Step S36, storing the voice information in the temporary storage partition according to the collection location information;

如果采集的语音信息的采集地点信息没有对应的临时存储分区，则建立新的临时存储分区并将语音信息存储到新的临时存储分区中，若已经存在相同采集地点信息的临时存储分区，则直接将所述语音信息存储到对应已有的存储分区中。If the collection location information of the collected voice information does not have a corresponding temporary storage partition, then create a new temporary storage partition and store the voice information in the new temporary storage partition, if there is already a temporary storage partition with the same collection location information, then directly The voice information is stored in a corresponding existing storage partition.

步骤S37，当所述临时存储分区中的语音信息数量超过预设数量，则将所述临时存储分区变为固定存储分区；Step S37, when the number of voice information in the temporary storage partition exceeds the preset number, then change the temporary storage partition into a fixed storage partition;

当临时存储分区中的语音信息较多时，说明临时存储分区中的语音信息可能已经构成了新的案件的相关语音信息，将临时存储分区变为固定存储分区，之后采集的语音信息如果符合条件，可以存储到这个新的固定存储分区中。When there is a lot of voice information in the temporary storage partition, it means that the voice information in the temporary storage partition may have constituted the relevant voice information of the new case, and the temporary storage partition becomes a fixed storage partition. If the voice information collected later meets the conditions, can be stored into this new pinned bucket.

步骤S38，根据所述固定存储分区中语音信息的采集时间信息，对所述固定存储分区添加索引标记；Step S38, adding an index mark to the fixed storage partition according to the collection time information of the voice information in the fixed storage partition;

同样地，新的固定存储分区中也会添加索引标记，与前面的索引标记一样，新的固定存储分区的索引标记包含该存储分区最后一次更新的语音信息的采集时间信息。Similarly, an index mark will also be added to the new fixed storage partition. Like the previous index mark, the index mark of the new fixed storage partition contains the collection time information of the last updated voice information of the storage partition.

在本实施例中，对于采集到的语音信息，如果能够存储到已有的目标存储分区中，则保存到目标存储分区中并添加索引标记；无法存储到已有的存储分区中，则建立新的临时存储分区并将采集的语音信息存储到临时存储分区中。In this embodiment, if the voice information collected can be stored in an existing target storage partition, then it is stored in the target storage partition and an index mark is added; if it cannot be stored in an existing storage partition, a new one is created. and store the collected voice information in the temporary storage partition.

进一步地，参照图2和图6，在本申请单机语音存储方法第四实施例的基础上，提供单机语音存储方法第五实施例，在第五实施例中，Further, referring to FIG. 2 and FIG. 6 , on the basis of the fourth embodiment of the stand-alone voice storage method of the present application, a fifth embodiment of the stand-alone voice storage method is provided. In the fifth embodiment,

步骤S40包括：Step S40 includes:

步骤S41，获取上一次上传语音信息的上传时间；Step S41, obtaining the upload time of the last uploaded voice message;

采集终端会自动记录每一次的上传操作，在每一次上传语音信息时，都会把两次上传间隔中新采集的语音信息上传到公安声纹库中，因此每一次上传时，需要获取上一次上传语音信息的上传时间。The collection terminal will automatically record each upload operation. When uploading voice information each time, it will upload the newly collected voice information in the interval between two uploads to the public security voiceprint database. Therefore, each upload needs to obtain the last upload. The upload time of the audio message.

步骤S42，获取各存储分区的索引标记，根据所述上传时间，确定所述目标存储分区；Step S42, obtaining the index mark of each storage partition, and determining the target storage partition according to the upload time;

当存储分区的索引标记中数据更新时间晚于上一次的上传时间，则确定该目标存储分区需要进行语音信息的上传。When the update time of the data in the index mark of the storage partition is later than the last upload time, it is determined that the target storage partition needs to upload voice information.

步骤S43，根据所述目标存储分区中语音信息的采集时间信息，将所述所述目标存储分区中所述语音信息和对应的所述维度信息上传；Step S43, uploading the voice information in the target storage partition and the corresponding dimension information according to the collection time information of the voice information in the target storage partition;

确定目标存储分区后，获取目标存储分区中各语音信息的采集时间信息，若采集时间晚于上一次上传数据的上传时间，则将语音信息和对应的维度信息上传到公安声纹库中，同时根据维度信息，将语音信息存储到公安声纹库对应的区域中。After the target storage partition is determined, the collection time information of each voice information in the target storage partition is obtained. If the collection time is later than the upload time of the last uploaded data, the voice information and the corresponding dimension information are uploaded to the public security voiceprint database, and at the same time According to the dimension information, the voice information is stored in the corresponding area of the public security voiceprint database.

在本实施例中，根据存储分区的索引标记，快速确定目标存储分区，在根据语音信息的维度信息，确定相应上传的语音信息，将语音信息与对应的维度信息上传。In this embodiment, the target storage partition is quickly determined according to the index mark of the storage partition, and the corresponding uploaded voice information is determined according to the dimension information of the voice information, and the voice information and the corresponding dimension information are uploaded.

此外，参照图7，本申请实施例还提出一种单机语音存储装置，所述单机语音存储装置包括：In addition, referring to FIG. 7, the embodiment of the present application also proposes a stand-alone voice storage device, the stand-alone voice storage device includes:

可选地，所述判断模块还用于：Optionally, the judging module is also used for:

采集语音信息；Collect voice information;

获取所述连接结果；Obtain the connection result;

可选地，所述获取模块还用于：Optionally, the acquisition module is also used for:

可选地，所述存储模块还用于：Optionally, the storage module is also used for:

将所述语音信息按照所述采集地点信息存储到所述临时存储分区中；storing the voice information in the temporary storage partition according to the collection location information;

可选地，所述上传模块还用于：Optionally, the upload module is also used for:

本申请设备和可读存储介质(即计算机可读存储介质)的具体实施方式的拓展内容与上述单机语音存储方法各实施例基本相同，在此不做赘述。The extended content of the specific implementation of the device and the readable storage medium (that is, the computer-readable storage medium) of the present application is basically the same as the above-mentioned embodiments of the stand-alone voice storage method, and will not be repeated here.

需要说明的是，在本文中，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。It should be noted that, in this document, the term "comprising", "comprising" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article or apparatus comprising a set of elements includes not only those elements, It also includes other elements not expressly listed, or elements inherent in the process, method, article, or device. Without further limitations, an element defined by the phrase "comprising a ..." does not preclude the presence of additional identical elements in the process, method, article, or apparatus comprising that element.

上述本发明实施例序号仅仅为了描述，不代表实施例的优劣。The serial numbers of the above embodiments of the present invention are for description only, and do not represent the advantages and disadvantages of the embodiments.

通过以上的实施方式的描述，本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现，当然也可以通过硬件，但很多情况下前者是更佳的实施方式。基于这样的理解，本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中，包括若干指令用以使得一台终端(可以是手机，计算机，服务器，空调器，或者网络设备等)执行本发明各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the methods of the above embodiments can be implemented by means of software plus a necessary general-purpose hardware platform, and of course also by hardware, but in many cases the former is better implementation. Based on such an understanding, the essence of the technical solution of the present invention or the part that contributes to the prior art can be embodied in the form of software products, and the computer software products are stored in a storage medium (such as ROM/RAM, disk, CD) contains several instructions to make a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in various embodiments of the present invention.

上面结合附图对本发明的实施例进行了描述，但是本发明并不局限于上述的具体实施方式，上述的具体实施方式仅仅是示意性的，而不是限制性的，本领域的普通技术人员在本发明的启示下，在不脱离本发明宗旨和权利要求所保护的范围情况下，还可做出很多形式，这些均属于本发明的保护之内。Embodiments of the present invention have been described above in conjunction with the accompanying drawings, but the present invention is not limited to the above-mentioned specific implementations, and the above-mentioned specific implementations are only illustrative, rather than restrictive, and those of ordinary skill in the art will Under the enlightenment of the present invention, many forms can also be made without departing from the gist of the present invention and the protection scope of the claims, and these all belong to the protection of the present invention.

Claims

1. a stand-alone voice storage method is characterized in that, the stand-alone voice storage method comprises the following steps:

Collect voice information and judge whether the uploading conditions are met;

If the uploading condition is not satisfied, acquiring dimension information of the voice information;

storing the voice information in the target storage partition pointed to by the dimension information in the collection terminal storage partition, and adding an index mark of the voice information to the target storage partition;

When it is detected that the voice information satisfies the preset upload condition, determine the target storage partition based on the index mark, and upload the voice information and the corresponding dimension information in the target storage partition;

Wherein, the step of acquiring dimension information of the voice information includes:

determining voice feature information of the voice information;

Determining the collection location information of the voice information;

determining the collection time information of the voice information;

The dimension information is formed according to the voice feature information, the collection location information and the collection time information.

2. stand-alone voice storage method as claimed in claim 1, is characterized in that, the step of described collection voice information comprises:

Collect voice information;

analyzing the number of speakers of the voice information;

According to the number of speakers, the voice information is separated to obtain the voice information of each speaker.

3. stand-alone voice storage method as claimed in claim 2, is characterized in that, the step of described judgment whether satisfies upload condition comprises:

Establish a connection with the public security voiceprint database;

Get the connection result;

If the connection is successful, the upload condition is met;

If the connection fails, the upload condition is not met.

4. The stand-alone voice storage method according to claim 1, wherein the voice information is stored in the target storage partition pointed to by the dimension information in the collection terminal storage partition, and stored for the target The step of adding the index mark of the voice information to the partition comprises:

Find the target storage partition pointed to by the collection terminal according to the collection location information;

performing a similarity comparison between the voice information and the stored information in the target storage partition according to the voice feature information;

If the similarity is greater than a preset value, storing the voice information in the target storage partition;

Add the index mark to the target storage partition according to the collection time information.

5. stand-alone voice storage method as claimed in claim 4, is characterized in that, described voice information and the storage information in the described target storage partition are carried out after the step of similarity comparison comprising:

If the similarity is not greater than a preset value, a temporary storage partition is established according to the collection location information;

storing the voice information in the temporary storage partition according to the collection location information;

When the amount of voice information in the temporary storage partition exceeds a preset number, then change the temporary storage partition into a fixed storage partition;

An index mark is added to the fixed storage partition according to the collection time information of the voice information in the fixed storage partition.

6. The stand-alone voice storage method according to claim 5, wherein the target storage partition is determined based on the index mark, and the voice information in the target storage partition and the corresponding dimension information The upload steps include:

Obtain the upload time of the last uploaded voice message;

Obtain the index mark of each storage partition, and determine the target storage partition according to the upload time;

Upload the voice information and the corresponding dimension information in the target storage partition according to the collection time information of the voice information in the target storage partition.

7. A stand-alone voice storage device, characterized in that, the stand-alone voice storage device comprises:

Judging module, used for collecting voice information, judging whether the uploading condition is satisfied;

An acquisition module, configured to acquire dimension information of the voice information if the uploading condition is not met;

A storage module, configured to store the voice information in a target storage partition pointed to by the dimension information in the collection terminal storage partition, and add an index mark of the voice information to the target storage partition;

An upload module, configured to determine the target storage partition based on the index mark when it is detected that the voice information satisfies a preset upload condition, and upload the voice information and the corresponding dimension information in the target storage partition ;

The acquisition module is also used to determine the voice feature information of the voice information; determine the collection location information of the voice information; determine the collection time information of the voice information; according to the voice feature information, the collection location information and the collection time information to form the dimension information.

8. A stand-alone voice storage device, characterized in that the device comprises: a memory, a processor and a stand-alone voice storage program stored on the memory and operable on the processor, the stand-alone voice storage program When being executed by the processor, the steps of realizing the stand-alone voice storage method according to any one of claims 1 to 6 are realized.

9. A readable storage medium, characterized in that a computer program is stored on the readable storage medium, and when the computer program is executed by a processor, the stand-alone voice according to any one of claims 1 to 6 is realized The steps of the storage method.