CN107886969A

CN107886969A - A kind of audio frequency playing method and audio playing apparatus

Info

Publication number: CN107886969A
Application number: CN201711159491.4A
Authority: CN
Inventors: 石奇岭
Original assignee: Vivo Mobile Communication Co Ltd
Current assignee: Vivo Mobile Communication Co Ltd
Priority date: 2017-11-20
Filing date: 2017-11-20
Publication date: 2018-04-06
Anticipated expiration: 2037-11-20
Also published as: CN107886969B

Abstract

The present invention, which provides a kind of audio frequency playing method and audio playing apparatus, wherein method, to be included：Target audio is parsed, and the audio section for determining not include voice in the target audio is target sound frequency range；The target sound frequency range is skipped when playing the target audio.The embodiment of the present invention can identify the audio section for not including voice in target audio, and the audio section for not including voice is skipped when playing the target audio, can effectively improve audio playing efficiency, provide the user with conveniently.

Description

A kind of audio frequency playing method and audio playing apparatus

Technical field

The present invention relates to multimedia technology field, more particularly to a kind of audio frequency playing method and audio playing apparatus.

Background technology

One of major way recorded as people's record information, it can go back prime information exactly, be provided very to people Big convenience, such as minutes speed when not catching up with meeting process during meeting, recorded when can be in session, and after the meeting Listen to recording file and further improve minutes.Under many circumstances, there is discontinuity in the content of recording, such as record words It can cause to pause because of thinking or other reasonses during language, so as to produce the audio section for not including voice, these do not include voice Audio section can also take reproduction time in recording broadcasting, but not have practical significance, waste user time；Although user can adjust Whole playing progress bar, but due to the play position that user does not know the audio section for not including voice, easily miss useful information.

It can be seen that in the prior art, audio, which plays, can not precisely skip the audio section for not including voice, broadcasting effect is reduced Rate, made troubles to user.

The content of the invention

The embodiment of the present invention provides a kind of audio frequency playing method and audio playing apparatus, is broadcast with solving prior art sound intermediate frequency The audio section for not including voice can not precisely be skipped by putting, and reduce playing efficiency, the problem of being made troubles to user.

In order to solve the above-mentioned technical problem, the present invention is realized in：A kind of audio frequency playing method, broadcasts applied to audio Device is put, including：

Target audio is parsed, and the audio section for determining not include voice in the target audio is target sound frequency range；

The target sound frequency range is skipped when playing the target audio.

In a first aspect, the embodiments of the invention provide a kind of audio frequency playing method, applied to audio playing apparatus, including：

The target sound frequency range is skipped when playing the target audio.

Second aspect, the embodiments of the invention provide a kind of audio playing apparatus, including：

Determining module, for parsing target audio, and the audio section for determining not include voice in the target audio is mesh Mark with phonetic symbols frequency range；

Control module, for skipping the target sound frequency range when playing the target audio.

The third aspect, the embodiments of the invention provide another audio playing apparatus, including processor, memory, storage On the memory and the computer program that can run on the processor, the computer program are held by the processor The step of above-mentioned audio frequency playing method is realized during row.

Fourth aspect, the embodiments of the invention provide a kind of computer-readable recording medium, the computer-readable storage Computer program is stored with medium, the computer program realizes the step of above-mentioned audio frequency playing method when being executed by processor Suddenly.

So, the embodiment of the present invention can identify the audio section for not including voice in target audio, and play the mesh Being skipped during mark with phonetic symbols frequency does not include the audio section of voice, can effectively improve audio playing efficiency, provides the user with conveniently.

Brief description of the drawings

In order to illustrate the technical solution of the embodiments of the present invention more clearly, needed for being described below to the embodiment of the present invention The accompanying drawing to be used is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the present invention, For those of ordinary skill in the art, without having to pay creative labor, can also be obtained according to these accompanying drawings Take other accompanying drawings.

Fig. 1 is a kind of flow chart of audio frequency playing method provided in an embodiment of the present invention；

Fig. 2 is a kind of audio playing progress bar schematic diagram provided in an embodiment of the present invention；

Fig. 3 is the flow chart of another audio frequency playing method provided in an embodiment of the present invention；

Fig. 4 is a kind of sound wave spectrum schematic diagram of audio provided in an embodiment of the present invention；

Fig. 5 is the flow chart of another audio frequency playing method provided in an embodiment of the present invention；

Fig. 6 is a kind of structure chart of audio playing apparatus provided in an embodiment of the present invention；

Fig. 7 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention；

Fig. 8 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention；

Fig. 9 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention；

Figure 10 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention；

Figure 11 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention.

Embodiment

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is part of the embodiment of the present invention, rather than whole embodiments.Based on this hair Embodiment in bright, those of ordinary skill in the art's every other implementation acquired under the premise of creative work is not made Example, belongs to the scope of protection of the invention.

Referring to Fig. 1, Fig. 1 is a kind of flow chart of audio frequency playing method provided in an embodiment of the present invention, and the phonetic aspect of a dialect frequency is broadcast Put method and be applied to an audio playing apparatus, as shown in figure 1, comprising the following steps：

Step 101, parsing target audio, and the audio section for determining not include voice in the target audio is target audio Section.

In the step, methods described parsing target audio, and determine the audio section for not including voice in the target audio For target sound frequency range.The target audio can be audio or audio frequency and video.Methods described parses the target audio can To be the sound wave spectrum for parsing the target audio, and the sound for not including voice is identified according to the sound wave spectrum of the target audio Frequency range is target sound frequency range.The audio section for not including voice can be blank audio section or only include removing voice Outside other sound audio section.

Methods described identifies that the mode for the audio section for not including voice in the target audio can be specifically according to The sound wave spectrum of target audio identifies the blank audio section in the target audio, it is then determined that the blank in the target audio Audio section is target sound frequency range.Methods described can also carry out voiceprint analysis to the target audio, using vocal print technology of acoustic wave The sound wave matched in the target audio with voice vocal print feature is identified, it is then determined that not including in the target audio and voice The audio section of the sound wave of vocal print feature matching is target sound frequency range.

Step 102, skip the target sound frequency range when playing the target audio.

In the step, methods described skips the target sound frequency range when playing the target audio.For example, when with When family plays session recording, if thering are some there is no speech content audio section, such as meeting time of having a rest or participant in recording The recording of the think time of member, the section is not talked during methods described can directly skip session recording when playing session recording The time of content, user can directly listen to coherent meeting speech content, can effectively save the time of user, improve audio Playing efficiency.

It is understood that in some embodiments of the invention, methods described can not also directly skip the section and not talk The time of content, and the position of the target sound frequency range is marked in the playing progress bar of the session recording, such as such as Fig. 2 institutes Show, one group of bracket, one section of target sound frequency range of mark in the playing progress bar of target audio can be used, so, user can be with Target sound frequency range is accurately skipped according to the position of the target sound frequency range of mark, audio playing efficiency is improved, provides the user with It is convenient.

Alternatively, it is described to skip the target sound frequency range when playing the target audio, including：

The target sound frequency range is skipped when playing the target audio, and in the playing progress bar of the target audio Mark the position of the target sound frequency range.

In the embodiment, methods described skips the target sound frequency range when playing the target audio, and in the mesh The position of the target sound frequency range is marked in the playing progress bar of mark with phonetic symbols frequency.So, can not only skip automatically does not include voice Target sound frequency range, moreover it is possible to allow user understand target sound frequency range particular location, provide the user with conveniently.

In the embodiment of the present invention, above-mentioned audio playing apparatus can be any electronic installation with audio playing function, Such as：Mobile phone, tablet personal computer (Tablet Personal Computer), laptop computer (Laptop Computer), individual Digital assistants (personal digital assistant, abbreviation PDA), mobile Internet access device (Mobile Internet Device, MID) or wearable device (Wearable Device) etc..

In the present embodiment, methods described can identify the audio section for not including voice in target audio, and described in broadcasting Being skipped during target audio does not include the audio section of voice, can effectively improve audio playing efficiency, provides the user with conveniently.

Referring to Fig. 3, Fig. 3 is the flow chart of another audio frequency playing method provided in an embodiment of the present invention, and methods described should For an audio playing apparatus, as shown in figure 3, comprising the following steps：

Step 301, the sound wave spectrum for parsing target audio.

In the step, methods described obtains the sound wave spectrum of the target audio, and parses the sound wave of the target audio Frequency spectrum.

Step 302, the blank audio section in the sound wave spectrum of the target audio identification target audio, and really Blank audio section in the fixed target audio is target sound frequency range.

In the step, methods described identifies that the blank audio section in the target audio is target sound frequency range, for example, When the sound wave spectrum of the target audio is frequency spectrum as shown in Figure 4, methods described can identify that the clear band in frequency spectrum is Target sound frequency range, such as such as the target sound frequency range marked in Fig. 4.

Methods described can identify that the audio section that waveform range value is zero in the target audio is blank audio section, due to In view of absolutely quiet environment may be not present during recording, methods described can also identify wave-shape amplitude in the target audio Value is blank audio section less than the audio section of predetermined threshold value.Methods described can identify whole blank sounds in the target audio Frequency range is target sound frequency range, can also identify that the blank audio section that blank duration exceedes preset duration in the target audio is mesh Mark with phonetic symbols frequency range, as shown in Figure 4.

Step 303, skip the target sound frequency range when playing the target audio.

The step 303 is identical with the step 102 in the embodiment shown in Fig. 1 of the present invention, and here is omitted.

In the present embodiment, methods described identifies that the blank audio section in the target audio is target sound frequency range, and is broadcasting The target sound frequency range is skipped when putting the target audio, audio playing efficiency can be effectively improved, is provided the user with conveniently.

Alternatively, the blank audio section determined in the target audio is target sound frequency range, including：

Determine that the blank audio section that blank duration exceedes preset duration in the target audio is target sound frequency range.

In the embodiment, methods described not can determine whether that all blank audio sections in the target audio are target audio Section, and it is to determine that the blank audio section that blank duration exceedes preset duration in the target audio is target sound frequency range.So, no It can skip and normally be paused during being talked in audio, keep the rhythm talked in target audio.

Alternatively, the blank audio section is the audio section that wave-shape amplitude value is less than predetermined threshold value.

In the embodiment, the blank audio section is the audio section that wave-shape amplitude value is less than predetermined threshold value, due to consideration that Absolutely quiet environment may be not present during recording, methods described can be with allowable error, you can to determine waveform in target audio The audio section that range value is less than predetermined threshold value is blank audio section.

Referring to Fig. 5, Fig. 5 is the flow chart of another audio frequency playing method provided in an embodiment of the present invention, and methods described should For an audio playing apparatus, as shown in figure 5, comprising the following steps：

Step 501, voiceprint analysis are carried out to target audio.

In the step, methods described carries out voiceprint analysis to target audio, i.e., the sound wave spectrum of the target audio is entered Row analysis, obtains the vocal print feature of the target audio.

Step 502, the sound wave matched in the target audio with voice vocal print feature is identified using sound groove recognition technology in e.

In the step, methods described is identified in the target audio using sound groove recognition technology in e and matched with voice vocal print feature Sound wave.The audio playing apparatus can be prestored voice vocal print feature, and methods described will can be carried out to the target audio The vocal print feature obtained after voiceprint analysis is matched with the voice vocal print feature to prestore, and identify in the target audio with people The sound wave of several line characteristic matchings.It is understood that because the pronunciation of the mankind is produced by vocal organs, it, which has, is different from The stability features of sound caused by object (i.e. thing sound), methods described can be according to the voice vocal print feature identifications to prestore Voice sound wave and the several ripples of thing in target audio.

Step 503, the audio section for the sound wave for determining not include matching with voice vocal print feature in the target audio are mesh Mark with phonetic symbols frequency range.

In the step, methods described is determined in the target audio not including the sound of the sound wave matched with voice vocal print feature Frequency range is target sound frequency range, i.e., the audio section that methods described determines not include voice sound wave in the target audio is target audio Section.

Step 504, skip the target sound frequency range when playing the target audio.

The step 504 is identical with the step 102 in the embodiment shown in Fig. 1 of the present invention, and here is omitted.

In the present embodiment, methods described carries out voiceprint analysis to target audio, and the mesh is identified using sound groove recognition technology in e The sound wave matched in mark with phonetic symbols frequency with voice vocal print feature, and determine not include matching with voice vocal print feature in the target audio The audio section of sound wave be target sound frequency range, skip the target sound frequency range when playing the target audio, can effectively carry High audio playing efficiency, provide the user with conveniently.

Alternatively, the parsing target audio, and the audio section for determining not include voice in the target audio is target After the step of audio section, methods described also includes：

Identify in the target audio with the unmatched sound wave of voice vocal print feature；

Remove in the target audio with the unmatched sound wave of voice vocal print feature.

In the step, methods described is also identified in the target audio with the unmatched sound wave of voice vocal print feature, and is moved Except in the target and audio with the unmatched sound wave of voice vocal print feature.For example, when people talks under noisy environment, record Sound content can include voice and environmental noise, because environmental noise disturbs user may be caused to hear during recording broadcasting Voice in recording, leads to miss important information.In the embodiment, methods described can be identified and removed in the target audio With the unmatched sound wave of voice vocal print feature, you can with and remove the noise sound wave identified in the target audio, so as to Only retain the sound wave matched with voice vocal print feature in the target audio.For example, work as same in the 3rd to 5 second in an audio When include sound wave A and sound wave B, wherein, sound wave A is the sound wave that match with voice vocal print feature, and sound wave B is and voice vocal print spy Unmatched sound wave is levied, methods described can remove the sound wave B in the 3rd to 5 second, retain sound wave A.In such manner, it is possible to effectively remove Noise in target audio, avoid noise from bringing interference to user, prevent user's miss critical information.

Alternatively, methods described also includes：

Identify the sound wave matched in the target audio with default vocal print feature；

Play the audio section of sound wave that the target audio includes matching with default vocal print feature.

In the embodiment, methods described identifies the sound wave matched in the target audio with default vocal print feature, and plays The target audio includes the audio section of the sound wave matched with default vocal print feature.It is understood that the production of human language Life be speech center vocal organs between a complicated physiology physical process, each acoustical generator that everybody uses in speech Official's (such as tongue, tooth, larynx, lung, nasal cavity) has differences in terms of size and form, therefore, the vocal print point of different people Do not take on a different character.

For example, when user only needs to listen to the speech content of meeting speaker in session recording, methods described can To identify the sound wave matched in session recording with speaker's vocal print feature, and only broadcasting session recording includes and speaker's vocal print The audio section of the sound wave of characteristic matching, so, user can need not only listen to the speech content of speaker in a meeting, can Effectively save the time of user.

Referring to Fig. 6, Fig. 6 is a kind of structure chart of audio playing apparatus provided in an embodiment of the present invention, as shown in fig. 6, sound Frequency playing device 600 includes：

Determining module 601, for parsing target audio, and the audio section for determining not include voice in the target audio is Target sound frequency range；

Control module 602, for skipping the target sound frequency range when playing the target audio, or in the target The position of the target sound frequency range is marked in the playing progress bar of audio.

Alternatively, the control module 602, is specifically used for：

Alternatively, referring to Fig. 7, Fig. 7 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention, is such as schemed Shown in 7, the determining module 601, including：

Resolution unit 6011, for parsing the sound wave spectrum of target audio；

First determining unit 6012, for identifying the sky in the target audio according to the sound wave spectrum of the target audio White tone frequency range, and determine that the blank audio section in the target audio is target sound frequency range.

Alternatively, first determining unit 6012, is specifically used for：

Alternatively, referring to Fig. 8, Fig. 8 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention, is such as schemed Shown in 8, the determining module 601, including：

Analytic unit 6013, for carrying out voiceprint analysis to target audio；

Recognition unit 6014, matched for being identified using sound groove recognition technology in e in the target audio with voice vocal print feature Sound wave；

Second determining unit 6015, for determining not include the sound wave matched with voice vocal print feature in the target audio Audio section be target sound frequency range.

Alternatively, referring to Fig. 9, Fig. 9 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention, is such as schemed Shown in 9, the audio playing apparatus 600 also includes：

First identification module 603, for identify in the target audio with the unmatched sound wave of voice vocal print feature；

Remove module 604, for remove in the target audio with the unmatched sound wave of voice vocal print feature.

Alternatively, referring to Figure 10, Figure 10 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention, such as Shown in Figure 10, the audio playing apparatus 600 also includes：

Second identification module 605, for identifying the sound wave matched in the target audio with default vocal print feature；

Playing module 606, for the audio for the sound wave for playing the target audio to include to match with default vocal print feature Section.

Audio playing apparatus 600 provided in an embodiment of the present invention can realize that Fig. 1 to Fig. 5 embodiment of the method sound intermediate frequency is broadcast Each process of device realization is put, to avoid repeating, is repeated no more here.

In the embodiment of the present invention, the audio playing apparatus can identify the audio section for not including voice in target audio, And the audio section for not including voice is skipped when playing the target audio, or in the playing progress bar of the target audio Mark does not include the audio section of voice, facilitates user precisely to skip the audio section for not including voice, Neng Gouyou by the mark Effect improves audio playing efficiency, provides the user with conveniently.

Figure 11 is a kind of hardware architecture diagram for the audio playing apparatus for realizing each embodiment of the present invention, and the audio is broadcast Device 1100 is put to include but is not limited to：Radio frequency unit 1101, mixed-media network modules mixed-media 1102, audio output unit 1103, input block 1104th, sensor 1105, display unit 1106, user input unit 1007, interface unit 1108, memory 1109, processor The part such as 1110 and power supply 1111.It will be understood by those skilled in the art that the audio playing apparatus structure shown in Figure 11 is simultaneously The restriction to audio playing apparatus is not formed, and audio playing apparatus can be included than illustrating more or less parts, or group Close some parts, or different parts arrangement.In embodiments of the present invention, audio playing apparatus include but is not limited to mobile phone, Tablet personal computer, notebook computer, palm PC, car-mounted terminal, wearable device and pedometer etc..

Wherein, processor 1110, it is used for：

The target sound frequency range, or the playing progress bar in the target audio are skipped when playing the target audio The position of the upper mark target sound frequency range.

Alternatively, what processor 1110 performed skips the target sound frequency range when playing the target audio, including：

Alternatively, the parsing target audio that processor 1110 performs, and determine not include voice in the target audio Audio section is target sound frequency range, including：

Parse the sound wave spectrum of target audio；

Blank audio section in the target audio is identified according to the sound wave spectrum of the target audio, and determines the mesh Blank audio section in mark with phonetic symbols frequency is target sound frequency range.

Alternatively, the blank audio section in target audio described in the determination that the processor 1110 performs is target audio Section, including：

Alternatively, the parsing target audio that the processor 1110 performs, and determine not include people in the target audio The audio section of sound is target sound frequency range, including：

Voiceprint analysis are carried out to target audio；

The sound wave matched in the target audio with voice vocal print feature is identified using sound groove recognition technology in e；

The audio section for the sound wave for determining not include matching with voice vocal print feature in the target audio is target sound frequency range.

Alternatively, the processor 1110 performs parsing target audio, and determines not include voice in the target audio Audio section the step of being target sound frequency range after, can also realize following steps：

Alternatively, the processor 1110 can also realize following steps：

In the embodiment of the present invention, audio playing apparatus can identify the audio section for not including voice in target audio, and Being skipped when playing the target audio does not include the audio section of voice, can effectively improve audio playing efficiency, provide the user with It is convenient.

It should be understood that in the embodiment of the present invention, radio frequency unit 1101 can be used for receiving and sending messages or communication process in, signal Reception and transmission, specifically, by from base station downlink data receive after, handled to processor 1110；In addition, will be up Data are sent to base station.Generally, radio frequency unit 1101 includes but is not limited to antenna, at least one amplifier, transceiver, coupling Device, low-noise amplifier, duplexer etc..In addition, radio frequency unit 1101 can also pass through wireless communication system and network and other Equipment communication.

Audio playing apparatus has provided the user wireless broadband internet by mixed-media network modules mixed-media 1102 and accessed, as helped to use Send and receive e-mail, browse webpage and access streaming video etc. in family.

Audio output unit 1103 can be receiving by radio frequency unit 1101 or mixed-media network modules mixed-media 1102 or in memory It is sound that the voice data stored in 1109, which is converted into audio signal and exported,.Moreover, audio output unit 1103 can be with The audio output related to the specific function that audio playing apparatus 1100 performs is provided (for example, call signal receives sound, message Receive sound etc.).Audio output unit 1103 includes loudspeaker, buzzer and receiver etc..

Input block 1104 is used to receive audio or video signal.Input block 1104 can include graphics processor (Graphics Processing Unit, GPU) 11041 and microphone 11042, graphics processor 11041 in video to capturing The static images or the view data of video obtained in pattern or image capture mode by image capture apparatus (such as camera) enter Row processing.Picture frame after processing may be displayed on display unit 1106.Picture frame after the processing of graphics processor 11041 It can be stored in memory 1109 (or other storage mediums) or be carried out via radio frequency unit 1101 or mixed-media network modules mixed-media 1102 Send.Microphone 11042 can receive sound, and can be voice data by such acoustic processing.Audio after processing Data can be converted to the lattice that mobile communication base station can be sent to via radio frequency unit 1101 in the case of telephone calling model Formula exports.

Audio playing apparatus 1100 also includes at least one sensor 1105, for example, optical sensor, motion sensor and Other sensors.Specifically, optical sensor includes ambient light sensor and proximity transducer, wherein, ambient light sensor can root The brightness of display panel 11061 is adjusted according to the light and shade of ambient light, proximity transducer can move in audio playing apparatus 1100 When arriving in one's ear, display panel 11061 and/or backlight are closed.As one kind of motion sensor, accelerometer sensor is detectable each The size of (generally three axles) acceleration, can detect that size and the direction of gravity on individual direction when static, available for identifying sound Frequency playing device posture (such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as Pedometer, percussion) etc.；Sensor 1105 can also include fingerprint sensor, pressure sensor, iris sensor, molecule sensing Device, gyroscope, barometer, hygrometer, thermometer, infrared ray sensor etc., will not be repeated here.

Display unit 1106 is used for the information for showing the information inputted by user or being supplied to user.Display unit 1106 can Including display panel 11061, liquid crystal display (Liquid Crystal Display, LCD), organic light-emitting diodes can be used Forms such as (Organic Light-Emitting Diode, OLED) is managed to configure display panel 11061.

User input unit 1107 can be used for the numeral or character information for receiving input, and generation and audio playing apparatus User set and function control it is relevant key signals input.Specifically, user input unit 1107 includes contact panel 11071 and other input equipments 11072.Contact panel 11071, also referred to as touch-screen, user is collected on or near it Touch operation (for example user on contact panel 11071 or is being touched using any suitable object or annex such as finger, stylus Control the operation near panel 11071).Contact panel 11071 may include both touch detecting apparatus and touch controller.Its In, touch detecting apparatus detects the touch orientation of user, and detects the signal that touch operation is brought, and transmits a signal to touch control Device processed；Touch controller receives touch information from touch detecting apparatus, and is converted into contact coordinate, then gives processor 1110, order that reception processing device 1110 is sent simultaneously is performed.Furthermore, it is possible to using resistance-type, condenser type, infrared ray and The polytypes such as surface acoustic wave realize contact panel 11071.Except contact panel 11071, user input unit 1107 can be with Including other input equipments 11072.Specifically, other input equipments 11072 can include but is not limited to physical keyboard, function key (such as volume control button, switch key etc.), trace ball, mouse, action bars, will not be repeated here.

Further, contact panel 11071 can be covered on display panel 11061, when contact panel 11071 detects After touch operation on or near it, processor 1110 is sent to determine the type of touch event, is followed by subsequent processing device 1110 Corresponding visual output is provided on display panel 11061 according to the type of touch event.Although in fig. 11, contact panel 11071 and display panel 11061 are the parts independent as two to realize the input of audio playing apparatus and output function, but It is in some embodiments it is possible to which contact panel 11071 is integrated with display panel 11061 and realizes the defeated of audio playing apparatus Enter and output function, do not limit herein specifically.

Interface unit 1108 is the interface that external device (ED) is connected with audio playing apparatus 1100.For example, external device (ED) can be with Including wired or wireless head-band earphone port, external power source (or battery charger) port, wired or wireless FPDP, deposit Card storage port, for connect the port of device with identification module, audio input/output (I/O) port, video i/o port, Ear port etc..Interface unit 1108 can be used for receiving the input from external device (ED) (for example, data message, electric power etc. Deng) and one or more elements that the input received is transferred in audio playing apparatus 1100 or can be used in sound Data are transmitted between frequency playing device 1100 and external device (ED).

Memory 1109 can be used for storage software program and various data.Memory 1109 can mainly include storage program Area and storage data field, wherein, storing program area can storage program area, needed at least one function application program (such as Sound-playing function, image player function etc.) etc.；Storage data field can store uses created data (ratio according to mobile phone Such as voice data, phone directory) etc..In addition, memory 1109 can include high-speed random access memory, can also include non- Volatile memory, for example, at least a disk memory, flush memory device or other volatile solid-state parts.

Processor 1110 is the control centre of audio playing apparatus, is played using various interfaces and the whole audio of connection The various pieces of device, by running or performing the software program and/or module that are stored in memory 1109, and call and deposit The data in memory 1109 are stored up, perform the various functions and processing data of audio playing apparatus, are filled so as to be played to audio Put carry out integral monitoring.Processor 1110 may include one or more processing units；Preferably, processor 1110 can integrate application Processor and modem processor, wherein, application processor mainly handles operating system, user interface and application program etc., Modem processor mainly handles radio communication.It is understood that above-mentioned modem processor can not also be integrated into In processor 1110.

Audio playing apparatus 1100 can also include the power supply 1111 (such as battery) to all parts power supply, it is preferred that Power supply 1111 can be logically contiguous by power-supply management system and processor 1110, is managed so as to be realized by power-supply management system The functions such as charging, electric discharge and power managed.

In addition, audio playing apparatus 1100 includes some unshowned functional modules, will not be repeated here.

Preferably, the embodiment of the present invention also provides a kind of audio playing apparatus, including processor 1110, memory 1109, The computer program that can be run on memory 1109 and on the processor 1110 is stored in, the computer program is by processor 1110 realize each process of above-mentioned audio frequency playing method embodiment when performing, and can reach identical technique effect, to avoid Repeat, repeat no more here.

The embodiment of the present invention also provides a kind of computer-readable recording medium, and meter is stored with computer-readable recording medium Calculation machine program, the computer program realize each process of above-mentioned audio frequency playing method embodiment, and energy when being executed by processor Reach identical technique effect, to avoid repeating, repeat no more here.Wherein, described computer-readable recording medium, such as only Read memory (Read-Only Memory, abbreviation ROM), random access memory (Random Access Memory, abbreviation RAM), magnetic disc or CD etc..

It should be noted that herein, term " comprising ", "comprising" or its any other variant are intended to non-row His property includes, so that process, method, article or device including a series of elements not only include those key elements, and And also include the other element being not expressly set out, or also include for this process, method, article or device institute inherently Key element.In the absence of more restrictions, the key element limited by sentence "including a ...", it is not excluded that including this Other identical element also be present in the process of key element, method, article or device.

Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side Method can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but in many cases The former is more preferably embodiment.Based on such understanding, technical scheme is substantially done to prior art in other words Going out the part of contribution can be embodied in the form of software product, and the computer software product is stored in a storage medium In (such as ROM/RAM, magnetic disc, CD), including some instructions to cause a station terminal (can be mobile phone, computer, service Device, air conditioner, or network equipment etc.) perform method described in each embodiment of the present invention.

The foregoing is only a specific embodiment of the invention, but protection scope of the present invention is not limited thereto, any Those familiar with the art the invention discloses technical scope in, change or replacement can be readily occurred in, should all be contained Cover within protection scope of the present invention.Therefore, protection scope of the present invention should be defined by scope of the claims.

Claims

1. a kind of audio frequency playing method, applied to audio playing apparatus, it is characterised in that methods described includes：

The target sound frequency range is skipped when playing the target audio.

2. audio frequency playing method as claimed in claim 1, it is characterised in that described to skip institute when playing the target audio Target sound frequency range is stated, including：

The target sound frequency range is skipped when playing the target audio, and is marked in the playing progress bar of the target audio The position of the target sound frequency range.

3. audio frequency playing method as claimed in claim 1, it is characterised in that the parsing target audio, and determine the mesh The audio section for not including voice in mark with phonetic symbols frequency is target sound frequency range, including：

Parse the sound wave spectrum of target audio；

Blank audio section in the target audio is identified according to the sound wave spectrum of the target audio, and determines the target sound Blank audio section in frequency is target sound frequency range.

4. audio frequency playing method as claimed in claim 3, it is characterised in that the blank sound determined in the target audio Frequency range is target sound frequency range, including：

5. audio frequency playing method as claimed in claim 3, it is characterised in that the blank audio section is that wave-shape amplitude value is less than The audio section of predetermined threshold value.

6. audio frequency playing method as claimed in claim 1, it is characterised in that the parsing target audio, and determine the mesh The audio section for not including voice in mark with phonetic symbols frequency is target sound frequency range, including：

Voiceprint analysis are carried out to target audio；

7. the audio frequency playing method as described in any one of claim 1 to 6, it is characterised in that the parsing target audio, and really After the step of audio section for not including voice in the fixed target audio is target sound frequency range, methods described also includes：

8. the audio frequency playing method as described in any one of claim 1 to 6, it is characterised in that methods described also includes：

9. a kind of audio playing apparatus, it is characterised in that the audio playing apparatus includes：

Determining module, for parsing target audio, and the audio section for determining not include voice in the target audio is target sound Frequency range；

10. audio playing apparatus as claimed in claim 9, it is characterised in that the control module, be specifically used for：

11. audio playing apparatus as claimed in claim 9, it is characterised in that the determining module, including：

Resolution unit, for parsing the sound wave spectrum of target audio；

First determining unit, for identifying the blank audio in the target audio according to the sound wave spectrum of the target audio Section, and determine that the blank audio section in the target audio is target sound frequency range.

12. audio playing apparatus as claimed in claim 11, it is characterised in that first determining unit, be specifically used for：

13. audio playing apparatus as claimed in claim 11, it is characterised in that the blank audio section is that wave-shape amplitude value is small In the audio section of predetermined threshold value.

14. audio playing apparatus as claimed in claim 9, it is characterised in that the determining module, including：

Analytic unit, for carrying out voiceprint analysis to target audio；

Recognition unit, for identifying the sound wave matched in the target audio with voice vocal print feature using sound groove recognition technology in e；

Second determining unit, for determining in the target audio not including the audio section of the sound wave matched with voice vocal print feature For target sound frequency range.

15. the audio playing apparatus as described in any one of claim 9 to 14, it is characterised in that the audio playing apparatus is also Including：

First identification module, for identify in the target audio with the unmatched sound wave of voice vocal print feature；

Remove module, for remove in the target audio with the unmatched sound wave of voice vocal print feature.

16. the audio playing apparatus as described in any one of claim 9 to 14, it is characterised in that the audio playing apparatus is also Including：

Second identification module, for identifying the sound wave matched in the target audio with default vocal print feature；

Playing module, for the audio section for the sound wave for playing the target audio to include to match with default vocal print feature.

17. a kind of audio playing apparatus, it is characterised in that including processor, memory is stored on the memory and can be The computer program run on the processor, the computer program are realized such as claim 1 during the computing device The step of to audio frequency playing method any one of 8.