CN107181849A

CN107181849A - The way of recording and device

Info

Publication number: CN107181849A
Application number: CN201710256963.1A
Authority: CN
Inventors: 伍亮雄; 刘鸣; 骆艳飞
Original assignee: Beijing Xiaomi Mobile Software Co Ltd
Current assignee: Beijing Xiaomi Mobile Software Co Ltd
Priority date: 2017-04-19
Filing date: 2017-04-19
Publication date: 2017-09-19

Abstract

The disclosure is directed to the way of recording and device.This method includes：Whether during being recorded to audio signal, detect includes the crucial audio signal fragment for meeting preparatory condition in the audio signal；Detect when the audio signal includes the crucial audio signal fragment for meeting preparatory condition, the crucial audio signal fragment is marked in recording file.The technical scheme can be during recording to audio signal, the crucial audio signal fragment for meeting preparatory condition is marked in recording file, user can be quickly found out the corresponding recording of mark by being marked in recording file, it is simple to operate so as to obtain the content of user's care；Labeling process is intelligent, it is not necessary to which user participates in, and does not interfere with communication effect, so, it is possible to improve Consumer's Experience.

Description

Recording method and device

Technical Field

The disclosure relates to the technical field of terminal information processing, and in particular relates to a recording method and device.

Background

At present, most of mobile phone dialing software supports a call recording function, when a user needs to find a telephone number from the whole call recording, the user firstly finds an approximate time point, and then listens to the telephone number, so that the operation is quite complex, time is consumed, and the efficiency is low.

In the related art, when a user needs to record a telephone number in a calling process, the user records the telephone number by using a notebook or a memo of a mobile phone or records the telephone number by calling a dialing page to input the telephone number under the condition of not hanging up the telephone.

Disclosure of Invention

In order to overcome the problems in the related art, embodiments of the present disclosure provide a recording method and apparatus. The technical scheme is as follows:

according to a first aspect of the embodiments of the present disclosure, there is provided a sound recording method, including:

detecting whether a key audio signal fragment meeting a preset condition is included in an audio signal or not in the process of recording the audio signal;

and marking the key audio signal segments in the sound recording file when detecting that the audio signals comprise the key audio signal segments meeting the preset conditions.

The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects: according to the technical scheme, in the process of recording the audio signals, the key audio signal fragments meeting the preset conditions are marked in the recording file, and a user can quickly find the recording corresponding to the mark in the recording file through the mark, so that the content concerned by the user is obtained, and the operation is simple and convenient; the marking process is intelligent, does not need user participation, does not influence the conversation effect, and thus, the user experience can be improved.

In one embodiment, the detecting whether the key audio signal segment meeting the preset condition is included in the audio signal includes:

detecting whether the audio signal comprises continuous key audio signal sub-segments with preset number or not; the key audio signal sub-segment is matched with a sound sample in a sound sample set acquired in advance;

when the audio signal comprises a continuous preset number of key audio signal sub-segments, determining that the audio signal comprises key audio signal segments meeting preset conditions; the key audio signal segments comprise the preset number of consecutive key audio signal sub-segments; or,

when the audio signal does not include continuous key audio signal sub-segments with preset number, determining that the key audio signal segments meeting preset conditions are not included in the audio signal.

The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects: according to the technical scheme, whether the audio signals comprise the key audio signal segments meeting the preset conditions or not is determined by detecting whether the audio signals comprise the continuous key audio signal sub-segments with the preset number or not, the accuracy of determining the key audio signal segments is improved, the key audio signal segments meeting the preset conditions are marked in the recording file on the basis, and the accuracy of marking can be improved.

In one embodiment, said tagging said key audio signal segments in a sound recording file comprises:

adding first marking information in time information corresponding to the key audio signal segments in the sound recording file; the first marking information is used for marking the time information corresponding to the key audio signal segment on a playing progress bar according to the first marking information when the sound recording file is played.

The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects: according to the technical scheme, first mark information is added to time information corresponding to key audio signal segments in a recording file, after recording is finished, when the recording is played through a terminal or other playing equipment, the terminal or other playing equipment marks the time information corresponding to the key audio signal segments on a playing progress bar according to the first mark information in the recording file, for example, mark symbols such as flags are displayed on or beside the time corresponding to the key audio signal segments on the playing progress bar; the user can directly position the playing position where the key audio signal clip is played through the mark, and the marked recording can be quickly found.

acquiring key information carried by the key audio signal fragment;

and recording the key information into a sound recording file.

The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects: according to the technical scheme, the key information is recorded in the recording file independently, and when the recording file is played through the terminal or other playing equipment after the recording is finished, the terminal or other playing equipment can directly display the key information in a character form on the screen of the terminal or other playing equipment according to the key information recorded in the recording file, so that a user can obtain the information conveniently, the recording playback process is omitted, and the efficiency is improved.

In one embodiment, the method further comprises:

acquiring time information corresponding to the key audio signal fragment in the sound recording file;

and recording the acquired time information into the sound recording file.

The technical scheme provided by the embodiment of the disclosure can have the following beneficial effects: according to the technical scheme, the key audio signal fragments meeting the preset conditions are marked in the recording file, furthermore, the terminal also independently records the time information corresponding to the key audio signal fragments in the recording file, the time information is directly displayed on a terminal screen when the recording is played, and a user can accurately find the corresponding recording position according to the time information, so that the user experience is improved.

According to a second aspect of the embodiments of the present disclosure, there is provided a sound recording apparatus including:

the detection module is used for detecting whether a key audio signal fragment meeting a preset condition is included in the audio signal or not in the process of recording the audio signal;

and the marking module is used for marking the key audio signal segments in the recording file when detecting that the key audio signal segments meeting the preset conditions are included in the audio signals.

In one embodiment, the detection module includes:

the first detection submodule is used for detecting whether the audio signal comprises a continuous preset number of key audio signal sub-segments; the key audio signal sub-segment is matched with a sound sample in a sound sample set acquired in advance;

the first determining submodule is used for determining that the audio signal comprises key audio signal segments meeting preset conditions when the audio signal comprises a continuous preset number of key audio signal sub-segments; the key audio signal segments comprise the preset number of consecutive key audio signal sub-segments;

a second determining submodule, configured to determine that a key audio signal segment meeting a preset condition is not included in the audio signal when a consecutive preset number of key audio signal sub-segments are not included in the audio signal.

In one embodiment, the marking module adds first marking information to time information corresponding to the key audio signal segments in the sound recording file; the first marking information is used for marking the time information corresponding to the key audio signal segment on a playing progress bar according to the first marking information when the sound recording file is played.

In one embodiment, the tagging module comprises:

the acquisition submodule is used for acquiring key information carried by the key audio signal fragment;

and the recording submodule is used for recording the key information into a sound recording file.

In one embodiment, the apparatus further comprises:

the acquisition module is used for acquiring the time information corresponding to the key audio signal fragment in the sound recording file;

and the recording module is used for recording the acquired time information into the sound recording file.

According to a third aspect of the embodiments of the present disclosure, there is provided a sound recording apparatus including:

a processor;

a memory for storing processor-executable instructions;

wherein the processor is configured to:

It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure.

Drawings

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the present disclosure and together with the description, serve to explain the principles of the disclosure.

FIG. 1 is a flow diagram illustrating a recording method according to an example embodiment.

FIG. 2 is a flowchart illustrating a step 101 in a recording method according to an example embodiment.

FIG. 3 is a flowchart illustrating step 102 of a recording method according to an example embodiment.

FIG. 4 is a flowchart illustrating a recording method according to an example embodiment.

FIG. 5 is a flowchart illustrating a recording method according to an example embodiment.

FIG. 6 is a block diagram illustrating a sound recording apparatus according to an example embodiment.

FIG. 7 is a block diagram illustrating a sound recording apparatus according to an example embodiment.

FIG. 8 is a block diagram illustrating a sound recording apparatus according to an example embodiment.

FIG. 9 is a block diagram illustrating a sound recording apparatus according to an example embodiment.

FIG. 10 is a block diagram illustrating a sound recording apparatus according to an example embodiment.

FIG. 11 is a block diagram illustrating a sound recording apparatus according to an example embodiment.

FIG. 12 is a block diagram illustrating a sound recording apparatus according to an example embodiment.

Detailed Description

Reference will now be made in detail to the exemplary embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, like numbers in different drawings represent the same or similar elements unless otherwise indicated. The implementations described in the exemplary embodiments below are not intended to represent all implementations consistent with the present disclosure. Rather, they are merely examples of apparatus and methods consistent with certain aspects of the present disclosure, as detailed in the appended claims.

In the related technology, when a user needs to record a telephone number during a call, the user records the telephone number by using a notebook or a memo of a mobile phone or records the telephone number by calling a dialing page to input the telephone number under the condition of not hanging up the telephone; however, the related art has the following drawbacks: when the user records the telephone number by adopting the related technology, although the user does not need to hang up the telephone, the user can not be concentrated on and smoothly converse with the other party in the time of recording the telephone number, namely, the conversation process can be interrupted in the prior art, the conversation effect is influenced, and the user experience is poor.

In order to solve the above problem, an embodiment of the present disclosure provides a recording method, where the method includes: detecting whether a key audio signal fragment meeting a preset condition is included in an audio signal or not in the process of recording the audio signal; and marking the key audio signal segments in the recording file when detecting that the key audio signal segments meeting the preset conditions are included in the audio signals. The technical scheme provided by the embodiment of the disclosure can be used for facilitating a user to quickly find the telephone number in the recording file through the mark; the technical scheme relates to a terminal, wherein the terminal is an electronic device with a recording function, such as a smart phone, a recording device, a tablet computer, a wearable device or a vehicle-mounted mobile terminal.

Fig. 1 is a flowchart illustrating a recording method according to an exemplary embodiment, where an execution subject of the method may be a terminal, as shown in fig. 1, the method includes the following steps 101 and 102:

in step 101, during recording of an audio signal, it is detected whether a key audio signal segment meeting a preset condition is included in the audio signal.

For example, the audio signal is current call content of the terminal, or a sound source of an environment in which the terminal is currently located; the audio signal may be divided into at least two audio signal sections based on a time axis; key audio signal segments meeting preset conditions, such as audio signal segments carrying telephone numbers, certificate numbers or time; the preset conditions are, for example: the key audio signal segments include consecutive, numeric utterance segments of a specified numeric length.

After the terminal receives the instruction for starting recording, the terminal starts the recording function; the terminal detects whether a key audio signal fragment meeting a preset condition is included in the audio signal in the process of recording the audio signal and generating a recording file.

In step 102, when it is detected that a key audio signal segment meeting a preset condition is included in the audio signal, the key audio signal segment is marked in the recording file.

For example, if it is detected that the audio signal includes a key audio signal segment meeting a preset condition, the terminal marks the key audio signal segment in the recording file; for example, the terminal adds a mark to the starting position of a key audio signal segment in a recording file; after the recording is finished, the user can quickly find the marked recording in the recording file through the mark. If the key audio signal segment meeting the preset condition is not detected in the audio signal, the process is ended.

According to the technical scheme provided by the embodiment of the disclosure, in the process of recording the audio signal, the key audio signal segment which meets the preset condition is marked in the recording file, and a user can quickly find the recording corresponding to the mark in the recording file through the mark, so that the content concerned by the user is obtained, and the operation is simple and convenient; the marking process is intelligent, does not need user participation, does not influence the conversation effect, and thus, the user experience can be improved.

In one possible implementation, as shown in fig. 2, the step 101 shown in fig. 1 can be implemented as steps 201 and 204:

in step 201, an audio signal is recorded.

In step 202, detecting whether the audio signal comprises a continuous preset number of key audio signal sub-segments; the key audio signal sub-segment is matched with a sound sample in a sound sample set acquired in advance; if the audio signal includes a preset number of consecutive key audio signal sub-segments, go to step 203; if the audio signal does not include a predetermined number of consecutive key audio signal sub-segments, go to step 204.

Illustratively, a key audio signal segment, such as an audio signal segment carrying a telephone number, a certificate number or time, such as a numeric utterance.

For example, the implementation of step 202 may include: the terminal detects whether the audio signal comprises a key audio signal sub-segment matched with a sound sample in a pre-acquired sound sample set, namely, whether a sound sample matched with a current audio signal sub-segment in the audio signal exists in the pre-acquired sound sample set is judged, if yes, the current audio signal sub-segment is determined to belong to the key audio signal sub-segment, and if not, the current audio signal sub-segment is determined not to belong to the key audio signal sub-segment;

when the audio signal is detected to comprise a key audio signal sub-segment matched with a sound sample in a pre-acquired sound sample set, continuously judging whether a next audio signal sub-segment behind the key audio signal sub-segment in the audio signal also belongs to the key audio signal sub-segment;

judging whether continuous key audio signal sub-segments with preset number exist in the audio signal; when the audio signal is judged to have continuous key audio signal sub-segments with the preset number, determining that the audio signal comprises the key audio signal segments meeting the preset conditions; the key audio signal segments comprise the above-mentioned consecutive predetermined number of key audio signal sub-segments. And when the audio signal is judged not to have continuous key audio signal sub-segments with the preset number, determining that the key audio signal segments meeting the preset condition are not included in the audio signal.

In step 203, it is determined that a key audio signal segment meeting a preset condition is included in the audio signal.

Illustratively, the key audio signal segments include a consecutive preset number of key audio signal sub-segments, e.g., a consecutive preset number of numeric utterances.

When the audio signal comprises continuous key audio signal sub-segments with preset number, determining that the key audio signal segments meeting the preset condition are included in the audio signal.

In step 204, it is determined that no key audio signal segment meeting the preset condition is included in the audio signal.

For example, when a predetermined number of consecutive key audio signal sub-segments are not included in the audio signal, it is determined that a key audio signal segment satisfying a predetermined condition is not included in the audio signal.

According to the technical scheme provided by the embodiment of the disclosure, whether the audio signal comprises the key audio signal segments meeting the preset condition is determined by detecting whether the audio signal comprises the continuous key audio signal sub-segments with the preset number, so that the accuracy of determining the key audio signal segments is improved, the key audio signal segments meeting the preset condition are marked in the recording file on the basis, and the accuracy of marking can be improved.

In one possible implementation, step 102 shown in fig. 1 may be implemented as: adding first marking information in time information corresponding to key audio signal segments in a sound recording file; the first marking information is used for marking the time information corresponding to the key audio signal segment on the playing progress bar according to the first marking information when the sound recording file is played.

In this embodiment, by adding the first flag information to the time information corresponding to the key audio signal segment in the recording file, when the recording is played through the terminal or other playing devices after the recording is finished, the terminal or other playing devices identify the time information corresponding to the key audio signal segment on the playing progress bar according to the first flag information in the recording file, for example, flag symbols and other flag symbols are displayed at or near the time corresponding to the key audio signal segment on the playing progress bar; the user can directly position the playing position where the key audio signal clip is played through the mark, and the marked recording can be quickly found.

In one possible implementation, as shown in FIG. 3, the step 102 shown in FIG. 1 can be implemented as steps 301 and 302:

in step 301, key information carried by key audio signal segments is obtained.

Illustratively, the key information includes any one or a combination of: a telephone number; a certificate number; time.

In step 302, key information is recorded into a sound recording file.

According to the method and the device, the key information is recorded in the recording file independently, after the recording is finished, when the recording is played through the terminal or other playing devices, the terminal or other playing devices can directly display the key information in a character form on the screen of the terminal or other playing devices according to the key information recorded in the recording file, a user can obtain the information conveniently, the recording playback process is omitted, and the efficiency is improved.

In a possible implementation manner, as shown in fig. 4, the recording method according to the present disclosure may further include the following steps 401 and 402:

in step 401, time information corresponding to the key audio signal segment in the audio record file is obtained.

In an example, in the process of recording the audio signal, the terminal marks a key audio signal segment meeting a preset condition in the recording file, and acquires time information corresponding to the key audio signal segment in the recording file.

In step 402, the acquired time information is recorded into a sound recording file.

In the embodiment, the key audio signal segments meeting the preset conditions are marked in the recording file, furthermore, the terminal also records the time information corresponding to the key audio signal segments in the recording file separately, the time information is directly displayed on a terminal screen when the recording is played, and a user can accurately find the corresponding recording position according to the time information, so that the user experience is improved.

The implementation is described in detail below by way of several embodiments.

FIG. 5 is a flow diagram illustrating a method of recording sound implemented by a terminal in accordance with an exemplary embodiment; as shown in fig. 5, the method comprises the steps of:

in step 501, an audio signal is recorded.

In step 502, in the process of recording the audio signal, detecting whether the audio signal includes a continuous preset number of key audio signal sub-segments; the key audio signal sub-segment is matched with a sound sample in a sound sample set acquired in advance; if the audio signal includes a predetermined number of consecutive key audio signal sub-segments, go to step 503; if the audio signal does not include a predetermined number of consecutive key audio signal sub-segments, go to step 504.

In step 503, determining that a key audio signal segment meeting a preset condition is included in the audio signal; go to step 505; wherein the key audio signal segment comprises a preset number of consecutive key audio signal sub-segments.

In step 504, determining that the key audio signal segment meeting the preset condition is not included in the audio signal; the flow ends.

In step 505, adding first flag information to the time information corresponding to the key audio signal segment in the audio file; the first marking information is used for marking the time information corresponding to the key audio signal segment on the playing progress bar according to the first marking information when the sound recording file is played.

Illustratively, a terminal acquires key information carried by a key audio signal segment; and recording the key information into a sound recording file.

In step 506, time information corresponding to the key audio signal segment in the audio record file is obtained.

In step 507, the acquired time information is recorded into a sound recording file.

According to the technical scheme provided by the embodiment of the disclosure, in the process of recording the audio signal, the key audio signal fragment meeting the preset condition is marked in the recording file, and a user can quickly find the recording corresponding to the mark in the recording file through the mark, so that the content concerned by the user is obtained, and the operation is simple and convenient; the marking process is intelligent, does not need user participation, does not influence the conversation effect, and thus, the user experience can be improved.

As a possible embodiment, there is provided a scheme for automatically dotting a call recording telephone number, where the scheme is to make a number recognition engine and a set dotting numerical length by pre-obtaining or recording various numeric pronunciations, and when a call recording is started, to dotte a numeric string recording conforming to the numerical length by using the number recognition engine and the set dotting numerical length, so as to facilitate a user to quickly find a recording corresponding to the telephone number through a tag in the call recording. The scheme specifically comprises the following steps:

step a, providing 10-digit pronunciation records such as 0-9 in the terminal in advance to be used as a sound sample;

b, setting the length of the marked digital pronunciation to be N, and recording continuous digital pronunciations of more than or equal to 8 digits if the length of the marked digital pronunciations is N;

c, in the process of call recording, if the fact that the pronunciation is consistent with the recorded digital pronunciation is detected, marking preparation is started, and when the next continuous N pronunciations can respectively search the matched sound samples in the recorded sound samples, a marking point is added before the first pronunciation;

and d, after the recording is finished, the user can quickly find the corresponding recording through the mark points.

The technical scheme provided by the embodiment of the disclosure automatically dots the recording according to the voice sample and the set conversation character length by recording the voice sample, the dotting process is intelligent, the user can quickly find the recording corresponding to the mark in the recording file by the mark, the operation is simple and convenient, and the user experience can be improved.

The following are embodiments of the disclosed apparatus that may be used to perform embodiments of the disclosed methods.

FIG. 6 is a block diagram illustrating a sound recording apparatus in accordance with an exemplary embodiment; the apparatus may be implemented in various ways, for example, with all of the components of the apparatus being implemented in a terminal, or with components of the apparatus being implemented in a coupled manner on the terminal side; the apparatus may implement the method related to the present disclosure through software, hardware, or a combination of the two, as shown in fig. 6, the recording apparatus includes: a detection module 601 and a marking module 602, wherein:

the detection module 601 is configured to detect whether a key audio signal segment meeting a preset condition is included in an audio signal during recording of the audio signal;

the tagging module 602 is configured to tag key audio signal segments in the sound recording file when detecting that the key audio signal segments meet a preset condition are included in the audio signal.

In the recording device provided by the embodiment of the present disclosure, in the process of recording an audio signal by configuring the detection module 601, it is detected whether a key audio signal segment meeting a preset condition is included in the audio signal; the marking module 602 marks key audio signal segments in the recording file when detecting that the audio signal comprises the key audio signal segments meeting preset conditions; therefore, in the process of recording the audio signals, the key audio signal fragments meeting the preset conditions are marked in the recording file, and a user can quickly find the recording corresponding to the marks in the recording file through the marks, so that the content concerned by the user is obtained, and the operation is simple and convenient; the marking process is intelligent, does not need user participation, does not influence the conversation effect, and thus, the user experience can be improved.

In a possible implementation, as shown in fig. 7, the sound recording apparatus shown in fig. 6 may further include a detection module 601 configured to include: a first detection sub-module 701, a first determination sub-module 702, and a second determination sub-module 703, wherein:

the first detection submodule 701 is configured to detect whether a consecutive, preset number of key audio signal sub-segments are included in the audio signal; the key audio signal sub-segment is matched with a sound sample in a sound sample set acquired in advance;

the first determining submodule 702 is configured to determine that a key audio signal segment meeting a preset condition is included in the audio signal when a consecutive, preset number of key audio signal sub-segments are included in the audio signal; the key audio signal segments comprise a preset number of consecutive key audio signal sub-segments;

the second determining sub-module 703 is configured to determine that a key audio signal segment complying with a preset condition is not included in the audio signal when a consecutive, preset number of key audio signal sub-segments is not included in the audio signal.

In one possible implementation, the tagging module 602 adds first tagging information to time information corresponding to key audio signal segments in the audio record file; the first marking information is used for marking the time information corresponding to the key audio signal segment on the playing progress bar according to the first marking information when the sound recording file is played.

In one possible embodiment, as shown in fig. 8, the sound recording apparatus shown in fig. 6 may further include a marking module 602 configured to include: an acquisition sub-module 801 and a recording sub-module 802, wherein:

the obtaining sub-module 801 is configured to obtain key information carried by the key audio signal segment;

the recording sub-module 802 is configured to record the key information into the sound recording file.

In a possible implementation manner, as shown in fig. 9, the sound recording apparatus shown in fig. 6 may further include: an obtaining module 901 and a recording module 902, wherein:

the obtaining module 901 is configured to obtain time information corresponding to the key audio signal segment in the sound recording file;

the recording module 902 is configured to record the acquired time information into a sound recording file.

Fig. 10 is a block diagram illustrating an audio recording apparatus 1000 according to an exemplary embodiment, where the audio recording apparatus 1000 may be implemented in various manners, such as implementing all components of the apparatus in a terminal or implementing the components of the apparatus in a coupled manner on a terminal side; the recording apparatus 1000 includes:

a processor 1001;

a memory 1002 for storing processor-executable instructions;

wherein the processor 1001 is configured to: detecting whether a key audio signal fragment meeting a preset condition is included in an audio signal or not in the process of recording the audio signal; and marking the key audio signal segments in the recording file when detecting that the key audio signal segments meeting the preset conditions are included in the audio signals.

In one embodiment, the processor 1001 may be further configured to: detecting whether the audio signal comprises continuous key audio signal sub-segments with preset number or not; the key audio signal sub-segment is matched with a sound sample in a sound sample set acquired in advance;

when the audio signal comprises continuous key audio signal sub-segments with preset number, determining that the audio signal comprises key audio signal segments meeting preset conditions; the key audio signal segments comprise a preset number of consecutive key audio signal sub-segments; or,

when the audio signal does not include a continuous preset number of key audio signal sub-segments, determining that the key audio signal segments meeting the preset condition are not included in the audio signal.

In one embodiment, the processor 1001 may be further configured to: adding first marking information in time information corresponding to key audio signal segments in a sound recording file; the first marking information is used for marking the time information corresponding to the key audio signal segment on the playing progress bar according to the first marking information when the sound recording file is played.

In one embodiment, the processor 1001 may be further configured to: acquiring key information carried by a key audio signal fragment; and recording the key information into a sound recording file.

In one embodiment, the processor 1001 may be further configured to: acquiring time information corresponding to the key audio signal fragment in the sound recording file; and recording the acquired time information into a sound recording file.

According to the recording device provided by the embodiment of the disclosure, in the process of recording the audio signal, the key audio signal segment which meets the preset condition is marked in the recording file, and the user can quickly find the recording corresponding to the mark in the recording file through the mark, so that the content which the user cares about is obtained, and the operation is simple and convenient; the marking process is intelligent, does not need user participation, does not influence the conversation effect, and thus, the user experience can be improved.

With regard to the apparatus in the above-described embodiment, the specific manner in which each module performs the operation has been described in detail in the embodiment related to the method, and will not be elaborated here.

FIG. 11 is a block diagram illustrating a sound recording apparatus in accordance with an exemplary embodiment; the recording apparatus 1100 is applicable to a terminal; the sound recording apparatus 1100 may include one or more of the following components: processing component 1102, memory 1104, power component 1106, multimedia component 1108, audio component 1110, input/output (I/O) interface 1112, sensor component 1114, and communications component 1116.

The processing component 1102 generally controls the overall operation of the sound recording apparatus 1100, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing components 1102 may include one or more processors 1120 to execute instructions to perform all or a portion of the steps of the methods described above. Further, the processing component 1102 may include one or more modules that facilitate interaction between the processing component 1102 and other components. For example, the processing component 1102 may include a multimedia module to facilitate interaction between the multimedia component 1108 and the processing component 1102.

The memory 1104 is configured to store various types of data to support the operation at the sound recording apparatus 1100. Examples of such data include instructions for any application or method operating on the sound recording apparatus 1100, contact data, phonebook data, messages, pictures, videos, and the like. The memory 1104 may be implemented by any type or combination of volatile or non-volatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.

The power supply component 1106 provides power to the various components of the recording device 1100. The power components 1106 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power for the sound recording device 1100.

The multimedia component 1108 includes a screen that provides an output interface between the recording apparatus 1100 and the user. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia component 1108 includes a front facing camera and/or a rear facing camera. The front camera and/or the rear camera may receive external multimedia data when the recording apparatus 1100 is in an operation mode, such as a photographing mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have a focal length and optical zoom capability.

The audio component 1110 is configured to output and/or input audio signals. For example, the audio component 1110 includes a Microphone (MIC) configured to receive external audio signals when the recording apparatus 1100 is in an operating mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signals may further be stored in the memory 1104 or transmitted via the communication component 1116. In some embodiments, the audio assembly 1110 further includes a speaker for outputting audio signals.

The I/O interface 1112 provides an interface between the processing component 1102 and peripheral interface modules, which may be keyboards, click wheels, buttons, etc. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.

The sensor assembly 1114 includes one or more sensors for providing various aspects of status assessment for the recording apparatus 1100. For example, the sensor assembly 1114 may detect the open/closed status of the recording apparatus 1100, the relative positioning of the components, such as the display and keypad of the recording apparatus 1100, the sensor assembly 1114 may also detect a change in the position of the recording apparatus 1100 or a component of the recording apparatus 1100, the presence or absence of user contact with the recording apparatus 1100, the orientation or acceleration/deceleration of the recording apparatus 1100, and a change in the temperature of the recording apparatus 1100. The sensor assembly 1114 may include a proximity sensor configured to detect the presence of a nearby object without any physical contact. The sensor assembly 1114 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 1114 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.

The communication component 1116 is configured to facilitate wired or wireless communication between the sound recording apparatus 1100 and other devices. The sound recording device 1100 may access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component 1116 receives broadcast signals or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 1116 also includes a Near Field Communication (NFC) module to facilitate short-range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.

In an exemplary embodiment, the sound recording apparatus 1100 may be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, micro-controllers, microprocessors or other electronic components for performing the above-described methods.

In an exemplary embodiment, a non-transitory computer-readable storage medium comprising instructions, such as the memory 1104 comprising instructions, executable by the processor 1120 of the audio recording device 1100 to perform the method described above is also provided. For example, the non-transitory computer readable storage medium may be a ROM, a Random Access Memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, an optical data storage device, and the like.

A non-transitory computer-readable storage medium, instructions in which, when executed by a processor of an audio recording apparatus 1100, enable the audio recording apparatus 1100 to perform an audio recording method comprising: detecting whether a key audio signal fragment meeting a preset condition is included in an audio signal or not in the process of recording the audio signal; and marking the key audio signal segments in the recording file when detecting that the key audio signal segments meeting the preset conditions are included in the audio signals.

In one embodiment, detecting whether a key audio signal segment meeting a preset condition is included in an audio signal comprises:

In one embodiment, tagging key audio signal segments in a sound recording file comprises: adding first marking information in time information corresponding to key audio signal segments in a sound recording file; the first marking information is used for marking the time information corresponding to the key audio signal segment on the playing progress bar according to the first marking information when the sound recording file is played.

In one embodiment, tagging key audio signal segments in a sound recording file comprises: acquiring key information carried by a key audio signal fragment; and recording the key information into a sound recording file.

In one embodiment, the method further comprises: acquiring time information corresponding to the key audio signal fragment in the sound recording file; and recording the acquired time information into a sound recording file.

Fig. 12 is a block diagram illustrating a sound recording apparatus according to an exemplary embodiment. For example, the recording apparatus 1200 may be provided as a server. The sound recording apparatus 1200 includes a processing component 1202 that further includes one or more processors, and memory resources, represented by memory 1203, for storing instructions, such as application programs, that are executable by the processing component 1202. The application programs stored in the memory 1203 may include one or more modules that each correspond to a set of instructions. Further, the processing component 1202 is configured to execute instructions to perform the above-described methods.

The sound recording apparatus 1200 may further include a power supply component 1206 configured to perform power management of the sound recording apparatus 1200, a wired or wireless network interface 1205 configured to connect the sound recording apparatus 1200 to a network, and an input-output (I/O) interface 1208. The recording unit 1200 may operate based on an operating system stored in the memory 1203, such as Windows Server, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, or the like.

Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the disclosure disclosed herein. This application is intended to cover any variations, uses, or adaptations of the disclosure following, in general, the principles of the disclosure and including such departures from the present disclosure as come within known or customary practice within the art to which the disclosure pertains. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the disclosure being indicated by the following claims.

It will be understood that the present disclosure is not limited to the precise arrangements described above and shown in the drawings and that various modifications and changes may be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims

1. A method of recording a sound, comprising:

2. The method according to claim 1, wherein the detecting whether the key audio signal segment meeting the preset condition is included in the audio signal comprises:

when the audio signal comprises a continuous preset number of key audio signal sub-segments, determining that the audio signal comprises key audio signal segments meeting preset conditions; the key audio signal segments comprise the preset number of consecutive key audio signal sub-segments;

or,

3. The method of claim 1, wherein said tagging said key audio signal segments in a sound recording file comprises:

4. The method of claim 1, wherein said tagging said key audio signal segments in a sound recording file comprises:

acquiring key information carried by the key audio signal fragment;

and recording the key information into a sound recording file.

5. The method of claim 1, further comprising:

and recording the acquired time information into the sound recording file.

6. A sound recording apparatus, comprising:

7. The apparatus of claim 6, wherein the detection module comprises:

8. The apparatus of claim 6, wherein the tagging module adds first tagging information to time information corresponding to the key audio signal segment in a sound recording file; the first marking information is used for marking the time information corresponding to the key audio signal segment on a playing progress bar according to the first marking information when the sound recording file is played.

9. The apparatus of claim 6, wherein the marking module comprises:

10. The apparatus of claim 6, further comprising:

11. A sound recording apparatus, comprising:

a processor;

a memory for storing processor-executable instructions;

wherein the processor is configured to: