CN107886969A - A kind of audio frequency playing method and audio playing apparatus - Google Patents
A kind of audio frequency playing method and audio playing apparatus Download PDFInfo
- Publication number
- CN107886969A CN107886969A CN201711159491.4A CN201711159491A CN107886969A CN 107886969 A CN107886969 A CN 107886969A CN 201711159491 A CN201711159491 A CN 201711159491A CN 107886969 A CN107886969 A CN 107886969A
- Authority
- CN
- China
- Prior art keywords
- audio
- target
- playing
- frequency range
- sound
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 81
- 230000001755 vocal effect Effects 0.000 claims description 55
- 238000001228 spectrum Methods 0.000 claims description 20
- 238000004458 analytical method Methods 0.000 claims description 10
- 238000005516 engineering process Methods 0.000 claims description 10
- 238000004590 computer program Methods 0.000 claims description 9
- 230000006870 function Effects 0.000 description 11
- 230000008569 process Effects 0.000 description 8
- 238000003860 storage Methods 0.000 description 7
- 230000006854 communication Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 3
- 238000010586 diagram Methods 0.000 description 3
- 230000007613 environmental effect Effects 0.000 description 2
- 239000004973 liquid crystal related substance Substances 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 230000003068 static effect Effects 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 1
- 230000009471 action Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 238000005314 correlation function Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 210000000867 larynx Anatomy 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 210000003928 nasal cavity Anatomy 0.000 description 1
- 238000009527 percussion Methods 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 230000033764 rhythmic process Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 238000010897 surface acoustic wave method Methods 0.000 description 1
- 210000002105 tongue Anatomy 0.000 description 1
- 210000000515 tooth Anatomy 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Theoretical Computer Science (AREA)
- Signal Processing (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Health & Medical Sciences (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Circuit For Audible Band Transducer (AREA)
- Telephone Function (AREA)
Abstract
The present invention, which provides a kind of audio frequency playing method and audio playing apparatus, wherein method, to be included:Target audio is parsed, and the audio section for determining not include voice in the target audio is target sound frequency range;The target sound frequency range is skipped when playing the target audio.The embodiment of the present invention can identify the audio section for not including voice in target audio, and the audio section for not including voice is skipped when playing the target audio, can effectively improve audio playing efficiency, provide the user with conveniently.
Description
Technical field
The present invention relates to multimedia technology field, more particularly to a kind of audio frequency playing method and audio playing apparatus.
Background technology
One of major way recorded as people's record information, it can go back prime information exactly, be provided very to people
Big convenience, such as minutes speed when not catching up with meeting process during meeting, recorded when can be in session, and after the meeting
Listen to recording file and further improve minutes.Under many circumstances, there is discontinuity in the content of recording, such as record words
It can cause to pause because of thinking or other reasonses during language, so as to produce the audio section for not including voice, these do not include voice
Audio section can also take reproduction time in recording broadcasting, but not have practical significance, waste user time;Although user can adjust
Whole playing progress bar, but due to the play position that user does not know the audio section for not including voice, easily miss useful information.
It can be seen that in the prior art, audio, which plays, can not precisely skip the audio section for not including voice, broadcasting effect is reduced
Rate, made troubles to user.
The content of the invention
The embodiment of the present invention provides a kind of audio frequency playing method and audio playing apparatus, is broadcast with solving prior art sound intermediate frequency
The audio section for not including voice can not precisely be skipped by putting, and reduce playing efficiency, the problem of being made troubles to user.
In order to solve the above-mentioned technical problem, the present invention is realized in:A kind of audio frequency playing method, broadcasts applied to audio
Device is put, including:
Target audio is parsed, and the audio section for determining not include voice in the target audio is target sound frequency range;
The target sound frequency range is skipped when playing the target audio.
In a first aspect, the embodiments of the invention provide a kind of audio frequency playing method, applied to audio playing apparatus, including:
Target audio is parsed, and the audio section for determining not include voice in the target audio is target sound frequency range;
The target sound frequency range is skipped when playing the target audio.
Second aspect, the embodiments of the invention provide a kind of audio playing apparatus, including:
Determining module, for parsing target audio, and the audio section for determining not include voice in the target audio is mesh
Mark with phonetic symbols frequency range;
Control module, for skipping the target sound frequency range when playing the target audio.
The third aspect, the embodiments of the invention provide another audio playing apparatus, including processor, memory, storage
On the memory and the computer program that can run on the processor, the computer program are held by the processor
The step of above-mentioned audio frequency playing method is realized during row.
Fourth aspect, the embodiments of the invention provide a kind of computer-readable recording medium, the computer-readable storage
Computer program is stored with medium, the computer program realizes the step of above-mentioned audio frequency playing method when being executed by processor
Suddenly.
So, the embodiment of the present invention can identify the audio section for not including voice in target audio, and play the mesh
Being skipped during mark with phonetic symbols frequency does not include the audio section of voice, can effectively improve audio playing efficiency, provides the user with conveniently.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, needed for being described below to the embodiment of the present invention
The accompanying drawing to be used is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the present invention,
For those of ordinary skill in the art, without having to pay creative labor, can also be obtained according to these accompanying drawings
Take other accompanying drawings.
Fig. 1 is a kind of flow chart of audio frequency playing method provided in an embodiment of the present invention;
Fig. 2 is a kind of audio playing progress bar schematic diagram provided in an embodiment of the present invention;
Fig. 3 is the flow chart of another audio frequency playing method provided in an embodiment of the present invention;
Fig. 4 is a kind of sound wave spectrum schematic diagram of audio provided in an embodiment of the present invention;
Fig. 5 is the flow chart of another audio frequency playing method provided in an embodiment of the present invention;
Fig. 6 is a kind of structure chart of audio playing apparatus provided in an embodiment of the present invention;
Fig. 7 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention;
Fig. 8 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention;
Fig. 9 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention;
Figure 10 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention;
Figure 11 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation describes, it is clear that described embodiment is part of the embodiment of the present invention, rather than whole embodiments.Based on this hair
Embodiment in bright, those of ordinary skill in the art's every other implementation acquired under the premise of creative work is not made
Example, belongs to the scope of protection of the invention.
Referring to Fig. 1, Fig. 1 is a kind of flow chart of audio frequency playing method provided in an embodiment of the present invention, and the phonetic aspect of a dialect frequency is broadcast
Put method and be applied to an audio playing apparatus, as shown in figure 1, comprising the following steps:
Step 101, parsing target audio, and the audio section for determining not include voice in the target audio is target audio
Section.
In the step, methods described parsing target audio, and determine the audio section for not including voice in the target audio
For target sound frequency range.The target audio can be audio or audio frequency and video.Methods described parses the target audio can
To be the sound wave spectrum for parsing the target audio, and the sound for not including voice is identified according to the sound wave spectrum of the target audio
Frequency range is target sound frequency range.The audio section for not including voice can be blank audio section or only include removing voice
Outside other sound audio section.
Methods described identifies that the mode for the audio section for not including voice in the target audio can be specifically according to
The sound wave spectrum of target audio identifies the blank audio section in the target audio, it is then determined that the blank in the target audio
Audio section is target sound frequency range.Methods described can also carry out voiceprint analysis to the target audio, using vocal print technology of acoustic wave
The sound wave matched in the target audio with voice vocal print feature is identified, it is then determined that not including in the target audio and voice
The audio section of the sound wave of vocal print feature matching is target sound frequency range.
Step 102, skip the target sound frequency range when playing the target audio.
In the step, methods described skips the target sound frequency range when playing the target audio.For example, when with
When family plays session recording, if thering are some there is no speech content audio section, such as meeting time of having a rest or participant in recording
The recording of the think time of member, the section is not talked during methods described can directly skip session recording when playing session recording
The time of content, user can directly listen to coherent meeting speech content, can effectively save the time of user, improve audio
Playing efficiency.
It is understood that in some embodiments of the invention, methods described can not also directly skip the section and not talk
The time of content, and the position of the target sound frequency range is marked in the playing progress bar of the session recording, such as such as Fig. 2 institutes
Show, one group of bracket, one section of target sound frequency range of mark in the playing progress bar of target audio can be used, so, user can be with
Target sound frequency range is accurately skipped according to the position of the target sound frequency range of mark, audio playing efficiency is improved, provides the user with
It is convenient.
Alternatively, it is described to skip the target sound frequency range when playing the target audio, including:
The target sound frequency range is skipped when playing the target audio, and in the playing progress bar of the target audio
Mark the position of the target sound frequency range.
In the embodiment, methods described skips the target sound frequency range when playing the target audio, and in the mesh
The position of the target sound frequency range is marked in the playing progress bar of mark with phonetic symbols frequency.So, can not only skip automatically does not include voice
Target sound frequency range, moreover it is possible to allow user understand target sound frequency range particular location, provide the user with conveniently.
In the embodiment of the present invention, above-mentioned audio playing apparatus can be any electronic installation with audio playing function,
Such as:Mobile phone, tablet personal computer (Tablet Personal Computer), laptop computer (Laptop Computer), individual
Digital assistants (personal digital assistant, abbreviation PDA), mobile Internet access device (Mobile Internet
Device, MID) or wearable device (Wearable Device) etc..
In the present embodiment, methods described can identify the audio section for not including voice in target audio, and described in broadcasting
Being skipped during target audio does not include the audio section of voice, can effectively improve audio playing efficiency, provides the user with conveniently.
Referring to Fig. 3, Fig. 3 is the flow chart of another audio frequency playing method provided in an embodiment of the present invention, and methods described should
For an audio playing apparatus, as shown in figure 3, comprising the following steps:
Step 301, the sound wave spectrum for parsing target audio.
In the step, methods described obtains the sound wave spectrum of the target audio, and parses the sound wave of the target audio
Frequency spectrum.
Step 302, the blank audio section in the sound wave spectrum of the target audio identification target audio, and really
Blank audio section in the fixed target audio is target sound frequency range.
In the step, methods described identifies that the blank audio section in the target audio is target sound frequency range, for example,
When the sound wave spectrum of the target audio is frequency spectrum as shown in Figure 4, methods described can identify that the clear band in frequency spectrum is
Target sound frequency range, such as such as the target sound frequency range marked in Fig. 4.
Methods described can identify that the audio section that waveform range value is zero in the target audio is blank audio section, due to
In view of absolutely quiet environment may be not present during recording, methods described can also identify wave-shape amplitude in the target audio
Value is blank audio section less than the audio section of predetermined threshold value.Methods described can identify whole blank sounds in the target audio
Frequency range is target sound frequency range, can also identify that the blank audio section that blank duration exceedes preset duration in the target audio is mesh
Mark with phonetic symbols frequency range, as shown in Figure 4.
Step 303, skip the target sound frequency range when playing the target audio.
The step 303 is identical with the step 102 in the embodiment shown in Fig. 1 of the present invention, and here is omitted.
In the present embodiment, methods described identifies that the blank audio section in the target audio is target sound frequency range, and is broadcasting
The target sound frequency range is skipped when putting the target audio, audio playing efficiency can be effectively improved, is provided the user with conveniently.
Alternatively, the blank audio section determined in the target audio is target sound frequency range, including:
Determine that the blank audio section that blank duration exceedes preset duration in the target audio is target sound frequency range.
In the embodiment, methods described not can determine whether that all blank audio sections in the target audio are target audio
Section, and it is to determine that the blank audio section that blank duration exceedes preset duration in the target audio is target sound frequency range.So, no
It can skip and normally be paused during being talked in audio, keep the rhythm talked in target audio.
Alternatively, the blank audio section is the audio section that wave-shape amplitude value is less than predetermined threshold value.
In the embodiment, the blank audio section is the audio section that wave-shape amplitude value is less than predetermined threshold value, due to consideration that
Absolutely quiet environment may be not present during recording, methods described can be with allowable error, you can to determine waveform in target audio
The audio section that range value is less than predetermined threshold value is blank audio section.
Referring to Fig. 5, Fig. 5 is the flow chart of another audio frequency playing method provided in an embodiment of the present invention, and methods described should
For an audio playing apparatus, as shown in figure 5, comprising the following steps:
Step 501, voiceprint analysis are carried out to target audio.
In the step, methods described carries out voiceprint analysis to target audio, i.e., the sound wave spectrum of the target audio is entered
Row analysis, obtains the vocal print feature of the target audio.
Step 502, the sound wave matched in the target audio with voice vocal print feature is identified using sound groove recognition technology in e.
In the step, methods described is identified in the target audio using sound groove recognition technology in e and matched with voice vocal print feature
Sound wave.The audio playing apparatus can be prestored voice vocal print feature, and methods described will can be carried out to the target audio
The vocal print feature obtained after voiceprint analysis is matched with the voice vocal print feature to prestore, and identify in the target audio with people
The sound wave of several line characteristic matchings.It is understood that because the pronunciation of the mankind is produced by vocal organs, it, which has, is different from
The stability features of sound caused by object (i.e. thing sound), methods described can be according to the voice vocal print feature identifications to prestore
Voice sound wave and the several ripples of thing in target audio.
Step 503, the audio section for the sound wave for determining not include matching with voice vocal print feature in the target audio are mesh
Mark with phonetic symbols frequency range.
In the step, methods described is determined in the target audio not including the sound of the sound wave matched with voice vocal print feature
Frequency range is target sound frequency range, i.e., the audio section that methods described determines not include voice sound wave in the target audio is target audio
Section.
Step 504, skip the target sound frequency range when playing the target audio.
The step 504 is identical with the step 102 in the embodiment shown in Fig. 1 of the present invention, and here is omitted.
In the present embodiment, methods described carries out voiceprint analysis to target audio, and the mesh is identified using sound groove recognition technology in e
The sound wave matched in mark with phonetic symbols frequency with voice vocal print feature, and determine not include matching with voice vocal print feature in the target audio
The audio section of sound wave be target sound frequency range, skip the target sound frequency range when playing the target audio, can effectively carry
High audio playing efficiency, provide the user with conveniently.
Alternatively, the parsing target audio, and the audio section for determining not include voice in the target audio is target
After the step of audio section, methods described also includes:
Identify in the target audio with the unmatched sound wave of voice vocal print feature;
Remove in the target audio with the unmatched sound wave of voice vocal print feature.
In the step, methods described is also identified in the target audio with the unmatched sound wave of voice vocal print feature, and is moved
Except in the target and audio with the unmatched sound wave of voice vocal print feature.For example, when people talks under noisy environment, record
Sound content can include voice and environmental noise, because environmental noise disturbs user may be caused to hear during recording broadcasting
Voice in recording, leads to miss important information.In the embodiment, methods described can be identified and removed in the target audio
With the unmatched sound wave of voice vocal print feature, you can with and remove the noise sound wave identified in the target audio, so as to
Only retain the sound wave matched with voice vocal print feature in the target audio.For example, work as same in the 3rd to 5 second in an audio
When include sound wave A and sound wave B, wherein, sound wave A is the sound wave that match with voice vocal print feature, and sound wave B is and voice vocal print spy
Unmatched sound wave is levied, methods described can remove the sound wave B in the 3rd to 5 second, retain sound wave A.In such manner, it is possible to effectively remove
Noise in target audio, avoid noise from bringing interference to user, prevent user's miss critical information.
Alternatively, methods described also includes:
Identify the sound wave matched in the target audio with default vocal print feature;
Play the audio section of sound wave that the target audio includes matching with default vocal print feature.
In the embodiment, methods described identifies the sound wave matched in the target audio with default vocal print feature, and plays
The target audio includes the audio section of the sound wave matched with default vocal print feature.It is understood that the production of human language
Life be speech center vocal organs between a complicated physiology physical process, each acoustical generator that everybody uses in speech
Official's (such as tongue, tooth, larynx, lung, nasal cavity) has differences in terms of size and form, therefore, the vocal print point of different people
Do not take on a different character.
For example, when user only needs to listen to the speech content of meeting speaker in session recording, methods described can
To identify the sound wave matched in session recording with speaker's vocal print feature, and only broadcasting session recording includes and speaker's vocal print
The audio section of the sound wave of characteristic matching, so, user can need not only listen to the speech content of speaker in a meeting, can
Effectively save the time of user.
Referring to Fig. 6, Fig. 6 is a kind of structure chart of audio playing apparatus provided in an embodiment of the present invention, as shown in fig. 6, sound
Frequency playing device 600 includes:
Determining module 601, for parsing target audio, and the audio section for determining not include voice in the target audio is
Target sound frequency range;
Control module 602, for skipping the target sound frequency range when playing the target audio, or in the target
The position of the target sound frequency range is marked in the playing progress bar of audio.
Alternatively, the control module 602, is specifically used for:
The target sound frequency range is skipped when playing the target audio, and in the playing progress bar of the target audio
Mark the position of the target sound frequency range.
Alternatively, referring to Fig. 7, Fig. 7 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention, is such as schemed
Shown in 7, the determining module 601, including:
Resolution unit 6011, for parsing the sound wave spectrum of target audio;
First determining unit 6012, for identifying the sky in the target audio according to the sound wave spectrum of the target audio
White tone frequency range, and determine that the blank audio section in the target audio is target sound frequency range.
Alternatively, first determining unit 6012, is specifically used for:
Determine that the blank audio section that blank duration exceedes preset duration in the target audio is target sound frequency range.
Alternatively, the blank audio section is the audio section that wave-shape amplitude value is less than predetermined threshold value.
Alternatively, referring to Fig. 8, Fig. 8 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention, is such as schemed
Shown in 8, the determining module 601, including:
Analytic unit 6013, for carrying out voiceprint analysis to target audio;
Recognition unit 6014, matched for being identified using sound groove recognition technology in e in the target audio with voice vocal print feature
Sound wave;
Second determining unit 6015, for determining not include the sound wave matched with voice vocal print feature in the target audio
Audio section be target sound frequency range.
Alternatively, referring to Fig. 9, Fig. 9 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention, is such as schemed
Shown in 9, the audio playing apparatus 600 also includes:
First identification module 603, for identify in the target audio with the unmatched sound wave of voice vocal print feature;
Remove module 604, for remove in the target audio with the unmatched sound wave of voice vocal print feature.
Alternatively, referring to Figure 10, Figure 10 is the structure chart of another audio playing apparatus provided in an embodiment of the present invention, such as
Shown in Figure 10, the audio playing apparatus 600 also includes:
Second identification module 605, for identifying the sound wave matched in the target audio with default vocal print feature;
Playing module 606, for the audio for the sound wave for playing the target audio to include to match with default vocal print feature
Section.
Audio playing apparatus 600 provided in an embodiment of the present invention can realize that Fig. 1 to Fig. 5 embodiment of the method sound intermediate frequency is broadcast
Each process of device realization is put, to avoid repeating, is repeated no more here.
In the embodiment of the present invention, the audio playing apparatus can identify the audio section for not including voice in target audio,
And the audio section for not including voice is skipped when playing the target audio, or in the playing progress bar of the target audio
Mark does not include the audio section of voice, facilitates user precisely to skip the audio section for not including voice, Neng Gouyou by the mark
Effect improves audio playing efficiency, provides the user with conveniently.
Figure 11 is a kind of hardware architecture diagram for the audio playing apparatus for realizing each embodiment of the present invention, and the audio is broadcast
Device 1100 is put to include but is not limited to:Radio frequency unit 1101, mixed-media network modules mixed-media 1102, audio output unit 1103, input block
1104th, sensor 1105, display unit 1106, user input unit 1007, interface unit 1108, memory 1109, processor
The part such as 1110 and power supply 1111.It will be understood by those skilled in the art that the audio playing apparatus structure shown in Figure 11 is simultaneously
The restriction to audio playing apparatus is not formed, and audio playing apparatus can be included than illustrating more or less parts, or group
Close some parts, or different parts arrangement.In embodiments of the present invention, audio playing apparatus include but is not limited to mobile phone,
Tablet personal computer, notebook computer, palm PC, car-mounted terminal, wearable device and pedometer etc..
Wherein, processor 1110, it is used for:
Target audio is parsed, and the audio section for determining not include voice in the target audio is target sound frequency range;
The target sound frequency range, or the playing progress bar in the target audio are skipped when playing the target audio
The position of the upper mark target sound frequency range.
Alternatively, what processor 1110 performed skips the target sound frequency range when playing the target audio, including:
The target sound frequency range is skipped when playing the target audio, and in the playing progress bar of the target audio
Mark the position of the target sound frequency range.
Alternatively, the parsing target audio that processor 1110 performs, and determine not include voice in the target audio
Audio section is target sound frequency range, including:
Parse the sound wave spectrum of target audio;
Blank audio section in the target audio is identified according to the sound wave spectrum of the target audio, and determines the mesh
Blank audio section in mark with phonetic symbols frequency is target sound frequency range.
Alternatively, the blank audio section in target audio described in the determination that the processor 1110 performs is target audio
Section, including:
Determine that the blank audio section that blank duration exceedes preset duration in the target audio is target sound frequency range.
Alternatively, the blank audio section is the audio section that wave-shape amplitude value is less than predetermined threshold value.
Alternatively, the parsing target audio that the processor 1110 performs, and determine not include people in the target audio
The audio section of sound is target sound frequency range, including:
Voiceprint analysis are carried out to target audio;
The sound wave matched in the target audio with voice vocal print feature is identified using sound groove recognition technology in e;
The audio section for the sound wave for determining not include matching with voice vocal print feature in the target audio is target sound frequency range.
Alternatively, the processor 1110 performs parsing target audio, and determines not include voice in the target audio
Audio section the step of being target sound frequency range after, can also realize following steps:
Identify in the target audio with the unmatched sound wave of voice vocal print feature;
Remove in the target audio with the unmatched sound wave of voice vocal print feature.
Alternatively, the processor 1110 can also realize following steps:
Identify the sound wave matched in the target audio with default vocal print feature;
Play the audio section of sound wave that the target audio includes matching with default vocal print feature.
In the embodiment of the present invention, audio playing apparatus can identify the audio section for not including voice in target audio, and
Being skipped when playing the target audio does not include the audio section of voice, can effectively improve audio playing efficiency, provide the user with
It is convenient.
It should be understood that in the embodiment of the present invention, radio frequency unit 1101 can be used for receiving and sending messages or communication process in, signal
Reception and transmission, specifically, by from base station downlink data receive after, handled to processor 1110;In addition, will be up
Data are sent to base station.Generally, radio frequency unit 1101 includes but is not limited to antenna, at least one amplifier, transceiver, coupling
Device, low-noise amplifier, duplexer etc..In addition, radio frequency unit 1101 can also pass through wireless communication system and network and other
Equipment communication.
Audio playing apparatus has provided the user wireless broadband internet by mixed-media network modules mixed-media 1102 and accessed, as helped to use
Send and receive e-mail, browse webpage and access streaming video etc. in family.
Audio output unit 1103 can be receiving by radio frequency unit 1101 or mixed-media network modules mixed-media 1102 or in memory
It is sound that the voice data stored in 1109, which is converted into audio signal and exported,.Moreover, audio output unit 1103 can be with
The audio output related to the specific function that audio playing apparatus 1100 performs is provided (for example, call signal receives sound, message
Receive sound etc.).Audio output unit 1103 includes loudspeaker, buzzer and receiver etc..
Input block 1104 is used to receive audio or video signal.Input block 1104 can include graphics processor
(Graphics Processing Unit, GPU) 11041 and microphone 11042, graphics processor 11041 in video to capturing
The static images or the view data of video obtained in pattern or image capture mode by image capture apparatus (such as camera) enter
Row processing.Picture frame after processing may be displayed on display unit 1106.Picture frame after the processing of graphics processor 11041
It can be stored in memory 1109 (or other storage mediums) or be carried out via radio frequency unit 1101 or mixed-media network modules mixed-media 1102
Send.Microphone 11042 can receive sound, and can be voice data by such acoustic processing.Audio after processing
Data can be converted to the lattice that mobile communication base station can be sent to via radio frequency unit 1101 in the case of telephone calling model
Formula exports.
Audio playing apparatus 1100 also includes at least one sensor 1105, for example, optical sensor, motion sensor and
Other sensors.Specifically, optical sensor includes ambient light sensor and proximity transducer, wherein, ambient light sensor can root
The brightness of display panel 11061 is adjusted according to the light and shade of ambient light, proximity transducer can move in audio playing apparatus 1100
When arriving in one's ear, display panel 11061 and/or backlight are closed.As one kind of motion sensor, accelerometer sensor is detectable each
The size of (generally three axles) acceleration, can detect that size and the direction of gravity on individual direction when static, available for identifying sound
Frequency playing device posture (such as horizontal/vertical screen switching, dependent game, magnetometer pose calibrating), Vibration identification correlation function (such as
Pedometer, percussion) etc.;Sensor 1105 can also include fingerprint sensor, pressure sensor, iris sensor, molecule sensing
Device, gyroscope, barometer, hygrometer, thermometer, infrared ray sensor etc., will not be repeated here.
Display unit 1106 is used for the information for showing the information inputted by user or being supplied to user.Display unit 1106 can
Including display panel 11061, liquid crystal display (Liquid Crystal Display, LCD), organic light-emitting diodes can be used
Forms such as (Organic Light-Emitting Diode, OLED) is managed to configure display panel 11061.
User input unit 1107 can be used for the numeral or character information for receiving input, and generation and audio playing apparatus
User set and function control it is relevant key signals input.Specifically, user input unit 1107 includes contact panel
11071 and other input equipments 11072.Contact panel 11071, also referred to as touch-screen, user is collected on or near it
Touch operation (for example user on contact panel 11071 or is being touched using any suitable object or annex such as finger, stylus
Control the operation near panel 11071).Contact panel 11071 may include both touch detecting apparatus and touch controller.Its
In, touch detecting apparatus detects the touch orientation of user, and detects the signal that touch operation is brought, and transmits a signal to touch control
Device processed;Touch controller receives touch information from touch detecting apparatus, and is converted into contact coordinate, then gives processor
1110, order that reception processing device 1110 is sent simultaneously is performed.Furthermore, it is possible to using resistance-type, condenser type, infrared ray and
The polytypes such as surface acoustic wave realize contact panel 11071.Except contact panel 11071, user input unit 1107 can be with
Including other input equipments 11072.Specifically, other input equipments 11072 can include but is not limited to physical keyboard, function key
(such as volume control button, switch key etc.), trace ball, mouse, action bars, will not be repeated here.
Further, contact panel 11071 can be covered on display panel 11061, when contact panel 11071 detects
After touch operation on or near it, processor 1110 is sent to determine the type of touch event, is followed by subsequent processing device 1110
Corresponding visual output is provided on display panel 11061 according to the type of touch event.Although in fig. 11, contact panel
11071 and display panel 11061 are the parts independent as two to realize the input of audio playing apparatus and output function, but
It is in some embodiments it is possible to which contact panel 11071 is integrated with display panel 11061 and realizes the defeated of audio playing apparatus
Enter and output function, do not limit herein specifically.
Interface unit 1108 is the interface that external device (ED) is connected with audio playing apparatus 1100.For example, external device (ED) can be with
Including wired or wireless head-band earphone port, external power source (or battery charger) port, wired or wireless FPDP, deposit
Card storage port, for connect the port of device with identification module, audio input/output (I/O) port, video i/o port,
Ear port etc..Interface unit 1108 can be used for receiving the input from external device (ED) (for example, data message, electric power etc.
Deng) and one or more elements that the input received is transferred in audio playing apparatus 1100 or can be used in sound
Data are transmitted between frequency playing device 1100 and external device (ED).
Memory 1109 can be used for storage software program and various data.Memory 1109 can mainly include storage program
Area and storage data field, wherein, storing program area can storage program area, needed at least one function application program (such as
Sound-playing function, image player function etc.) etc.;Storage data field can store uses created data (ratio according to mobile phone
Such as voice data, phone directory) etc..In addition, memory 1109 can include high-speed random access memory, can also include non-
Volatile memory, for example, at least a disk memory, flush memory device or other volatile solid-state parts.
Processor 1110 is the control centre of audio playing apparatus, is played using various interfaces and the whole audio of connection
The various pieces of device, by running or performing the software program and/or module that are stored in memory 1109, and call and deposit
The data in memory 1109 are stored up, perform the various functions and processing data of audio playing apparatus, are filled so as to be played to audio
Put carry out integral monitoring.Processor 1110 may include one or more processing units;Preferably, processor 1110 can integrate application
Processor and modem processor, wherein, application processor mainly handles operating system, user interface and application program etc.,
Modem processor mainly handles radio communication.It is understood that above-mentioned modem processor can not also be integrated into
In processor 1110.
Audio playing apparatus 1100 can also include the power supply 1111 (such as battery) to all parts power supply, it is preferred that
Power supply 1111 can be logically contiguous by power-supply management system and processor 1110, is managed so as to be realized by power-supply management system
The functions such as charging, electric discharge and power managed.
In addition, audio playing apparatus 1100 includes some unshowned functional modules, will not be repeated here.
Preferably, the embodiment of the present invention also provides a kind of audio playing apparatus, including processor 1110, memory 1109,
The computer program that can be run on memory 1109 and on the processor 1110 is stored in, the computer program is by processor
1110 realize each process of above-mentioned audio frequency playing method embodiment when performing, and can reach identical technique effect, to avoid
Repeat, repeat no more here.
The embodiment of the present invention also provides a kind of computer-readable recording medium, and meter is stored with computer-readable recording medium
Calculation machine program, the computer program realize each process of above-mentioned audio frequency playing method embodiment, and energy when being executed by processor
Reach identical technique effect, to avoid repeating, repeat no more here.Wherein, described computer-readable recording medium, such as only
Read memory (Read-Only Memory, abbreviation ROM), random access memory (Random Access Memory, abbreviation
RAM), magnetic disc or CD etc..
It should be noted that herein, term " comprising ", "comprising" or its any other variant are intended to non-row
His property includes, so that process, method, article or device including a series of elements not only include those key elements, and
And also include the other element being not expressly set out, or also include for this process, method, article or device institute inherently
Key element.In the absence of more restrictions, the key element limited by sentence "including a ...", it is not excluded that including this
Other identical element also be present in the process of key element, method, article or device.
Through the above description of the embodiments, those skilled in the art can be understood that above-described embodiment side
Method can add the mode of required general hardware platform to realize by software, naturally it is also possible to by hardware, but in many cases
The former is more preferably embodiment.Based on such understanding, technical scheme is substantially done to prior art in other words
Going out the part of contribution can be embodied in the form of software product, and the computer software product is stored in a storage medium
In (such as ROM/RAM, magnetic disc, CD), including some instructions to cause a station terminal (can be mobile phone, computer, service
Device, air conditioner, or network equipment etc.) perform method described in each embodiment of the present invention.
The foregoing is only a specific embodiment of the invention, but protection scope of the present invention is not limited thereto, any
Those familiar with the art the invention discloses technical scope in, change or replacement can be readily occurred in, should all be contained
Cover within protection scope of the present invention.Therefore, protection scope of the present invention should be defined by scope of the claims.
Claims (17)
1. a kind of audio frequency playing method, applied to audio playing apparatus, it is characterised in that methods described includes:
Target audio is parsed, and the audio section for determining not include voice in the target audio is target sound frequency range;
The target sound frequency range is skipped when playing the target audio.
2. audio frequency playing method as claimed in claim 1, it is characterised in that described to skip institute when playing the target audio
Target sound frequency range is stated, including:
The target sound frequency range is skipped when playing the target audio, and is marked in the playing progress bar of the target audio
The position of the target sound frequency range.
3. audio frequency playing method as claimed in claim 1, it is characterised in that the parsing target audio, and determine the mesh
The audio section for not including voice in mark with phonetic symbols frequency is target sound frequency range, including:
Parse the sound wave spectrum of target audio;
Blank audio section in the target audio is identified according to the sound wave spectrum of the target audio, and determines the target sound
Blank audio section in frequency is target sound frequency range.
4. audio frequency playing method as claimed in claim 3, it is characterised in that the blank sound determined in the target audio
Frequency range is target sound frequency range, including:
Determine that the blank audio section that blank duration exceedes preset duration in the target audio is target sound frequency range.
5. audio frequency playing method as claimed in claim 3, it is characterised in that the blank audio section is that wave-shape amplitude value is less than
The audio section of predetermined threshold value.
6. audio frequency playing method as claimed in claim 1, it is characterised in that the parsing target audio, and determine the mesh
The audio section for not including voice in mark with phonetic symbols frequency is target sound frequency range, including:
Voiceprint analysis are carried out to target audio;
The sound wave matched in the target audio with voice vocal print feature is identified using sound groove recognition technology in e;
The audio section for the sound wave for determining not include matching with voice vocal print feature in the target audio is target sound frequency range.
7. the audio frequency playing method as described in any one of claim 1 to 6, it is characterised in that the parsing target audio, and really
After the step of audio section for not including voice in the fixed target audio is target sound frequency range, methods described also includes:
Identify in the target audio with the unmatched sound wave of voice vocal print feature;
Remove in the target audio with the unmatched sound wave of voice vocal print feature.
8. the audio frequency playing method as described in any one of claim 1 to 6, it is characterised in that methods described also includes:
Identify the sound wave matched in the target audio with default vocal print feature;
Play the audio section of sound wave that the target audio includes matching with default vocal print feature.
9. a kind of audio playing apparatus, it is characterised in that the audio playing apparatus includes:
Determining module, for parsing target audio, and the audio section for determining not include voice in the target audio is target sound
Frequency range;
Control module, for skipping the target sound frequency range when playing the target audio.
10. audio playing apparatus as claimed in claim 9, it is characterised in that the control module, be specifically used for:
The target sound frequency range is skipped when playing the target audio, and is marked in the playing progress bar of the target audio
The position of the target sound frequency range.
11. audio playing apparatus as claimed in claim 9, it is characterised in that the determining module, including:
Resolution unit, for parsing the sound wave spectrum of target audio;
First determining unit, for identifying the blank audio in the target audio according to the sound wave spectrum of the target audio
Section, and determine that the blank audio section in the target audio is target sound frequency range.
12. audio playing apparatus as claimed in claim 11, it is characterised in that first determining unit, be specifically used for:
Determine that the blank audio section that blank duration exceedes preset duration in the target audio is target sound frequency range.
13. audio playing apparatus as claimed in claim 11, it is characterised in that the blank audio section is that wave-shape amplitude value is small
In the audio section of predetermined threshold value.
14. audio playing apparatus as claimed in claim 9, it is characterised in that the determining module, including:
Analytic unit, for carrying out voiceprint analysis to target audio;
Recognition unit, for identifying the sound wave matched in the target audio with voice vocal print feature using sound groove recognition technology in e;
Second determining unit, for determining in the target audio not including the audio section of the sound wave matched with voice vocal print feature
For target sound frequency range.
15. the audio playing apparatus as described in any one of claim 9 to 14, it is characterised in that the audio playing apparatus is also
Including:
First identification module, for identify in the target audio with the unmatched sound wave of voice vocal print feature;
Remove module, for remove in the target audio with the unmatched sound wave of voice vocal print feature.
16. the audio playing apparatus as described in any one of claim 9 to 14, it is characterised in that the audio playing apparatus is also
Including:
Second identification module, for identifying the sound wave matched in the target audio with default vocal print feature;
Playing module, for the audio section for the sound wave for playing the target audio to include to match with default vocal print feature.
17. a kind of audio playing apparatus, it is characterised in that including processor, memory is stored on the memory and can be
The computer program run on the processor, the computer program are realized such as claim 1 during the computing device
The step of to audio frequency playing method any one of 8.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711159491.4A CN107886969B (en) | 2017-11-20 | 2017-11-20 | Audio playing method and audio playing device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711159491.4A CN107886969B (en) | 2017-11-20 | 2017-11-20 | Audio playing method and audio playing device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107886969A true CN107886969A (en) | 2018-04-06 |
CN107886969B CN107886969B (en) | 2020-08-28 |
Family
ID=61777594
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711159491.4A Active CN107886969B (en) | 2017-11-20 | 2017-11-20 | Audio playing method and audio playing device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107886969B (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109379497A (en) * | 2018-12-28 | 2019-02-22 | 努比亚技术有限公司 | Voice messaging playback method, mobile terminal and computer readable storage medium |
CN109545246A (en) * | 2019-01-21 | 2019-03-29 | 维沃移动通信有限公司 | A kind of sound processing method and terminal device |
CN109994126A (en) * | 2019-03-11 | 2019-07-09 | 北京三快在线科技有限公司 | Audio message segmentation method, device, storage medium and electronic equipment |
CN112788187A (en) * | 2020-12-25 | 2021-05-11 | 北京百度网讯科技有限公司 | Audio data playing method, device, equipment, storage medium, program and terminal |
CN114005469A (en) * | 2021-10-20 | 2022-02-01 | 广州市网星信息技术有限公司 | Audio playing method and system capable of automatically skipping mute segment |
WO2022028227A1 (en) * | 2020-08-03 | 2022-02-10 | 北京字跳网络技术有限公司 | Information display method and device |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101098439A (en) * | 2006-06-27 | 2008-01-02 | 明基电通股份有限公司 | Recording/reproducing apparatus and recording/reproducing method |
CN101582285A (en) * | 2009-07-02 | 2009-11-18 | 福州思迈特数码科技有限公司 | High-effective record playing method |
CN104157301A (en) * | 2014-07-25 | 2014-11-19 | 广州三星通信技术研究有限公司 | Method, device and terminal deleting voice information blank segment |
EP2887698A1 (en) * | 2013-12-23 | 2015-06-24 | Samsung Electronics Co., Ltd | Hearing aid for playing audible advertisement or audible data |
CN106935253A (en) * | 2017-03-10 | 2017-07-07 | 北京奇虎科技有限公司 | The method of cutting out of audio file, device and terminal device |
CN107111646A (en) * | 2014-11-03 | 2017-08-29 | 开放电视公司 | The media presentation marked using audio fragment is changed |
-
2017
- 2017-11-20 CN CN201711159491.4A patent/CN107886969B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101098439A (en) * | 2006-06-27 | 2008-01-02 | 明基电通股份有限公司 | Recording/reproducing apparatus and recording/reproducing method |
CN101582285A (en) * | 2009-07-02 | 2009-11-18 | 福州思迈特数码科技有限公司 | High-effective record playing method |
EP2887698A1 (en) * | 2013-12-23 | 2015-06-24 | Samsung Electronics Co., Ltd | Hearing aid for playing audible advertisement or audible data |
CN104157301A (en) * | 2014-07-25 | 2014-11-19 | 广州三星通信技术研究有限公司 | Method, device and terminal deleting voice information blank segment |
CN107111646A (en) * | 2014-11-03 | 2017-08-29 | 开放电视公司 | The media presentation marked using audio fragment is changed |
CN106935253A (en) * | 2017-03-10 | 2017-07-07 | 北京奇虎科技有限公司 | The method of cutting out of audio file, device and terminal device |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109379497A (en) * | 2018-12-28 | 2019-02-22 | 努比亚技术有限公司 | Voice messaging playback method, mobile terminal and computer readable storage medium |
CN109545246A (en) * | 2019-01-21 | 2019-03-29 | 维沃移动通信有限公司 | A kind of sound processing method and terminal device |
CN109994126A (en) * | 2019-03-11 | 2019-07-09 | 北京三快在线科技有限公司 | Audio message segmentation method, device, storage medium and electronic equipment |
WO2022028227A1 (en) * | 2020-08-03 | 2022-02-10 | 北京字跳网络技术有限公司 | Information display method and device |
US12093313B2 (en) | 2020-08-03 | 2024-09-17 | Beijing Zitiao Network Technology Co., Ltd. | Information displaying method and device |
CN112788187A (en) * | 2020-12-25 | 2021-05-11 | 北京百度网讯科技有限公司 | Audio data playing method, device, equipment, storage medium, program and terminal |
CN114005469A (en) * | 2021-10-20 | 2022-02-01 | 广州市网星信息技术有限公司 | Audio playing method and system capable of automatically skipping mute segment |
Also Published As
Publication number | Publication date |
---|---|
CN107886969B (en) | 2020-08-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107886969A (en) | A kind of audio frequency playing method and audio playing apparatus | |
CN107799125A (en) | A kind of audio recognition method, mobile terminal and computer-readable recording medium | |
CN107734378A (en) | A kind of audio and video synchronization method, device and mobile terminal | |
CN111638779A (en) | Audio playing control method and device, electronic equipment and readable storage medium | |
CN107864353B (en) | A kind of video recording method and mobile terminal | |
CN108289244A (en) | Video caption processing method, mobile terminal and computer readable storage medium | |
CN108874357A (en) | A kind of reminding method and mobile terminal | |
CN107484081A (en) | A kind of method of adjustment of audio signal, device, terminal and computer-readable recording medium | |
CN108418948A (en) | A kind of based reminding method, mobile terminal and computer storage media | |
CN110177296A (en) | A kind of video broadcasting method and mobile terminal | |
CN107506385A (en) | A kind of video file retrieval method, equipment and computer-readable recording medium | |
CN108712566A (en) | A kind of voice assistant awakening method and mobile terminal | |
CN108391190B (en) | A kind of noise-reduction method, earphone and computer readable storage medium | |
CN107678829A (en) | A kind of application control method and mobile terminal | |
CN109194899A (en) | A kind of method and terminal of audio-visual synchronization | |
CN108650402A (en) | A kind of method of Anti-addiction, equipment and computer readable storage medium | |
CN108551534A (en) | The method and device of multiple terminals voice communication | |
CN107229389A (en) | A kind of method of shared file, equipment and computer-readable recording medium | |
CN108989558A (en) | The method and device of terminal call | |
CN107798107A (en) | The method and mobile device of song recommendations | |
CN108307043A (en) | Speech message conversion method, mobile terminal and computer readable storage medium | |
CN107403623A (en) | Store method, terminal, Cloud Server and the readable storage medium storing program for executing of recording substance | |
CN109545246A (en) | A kind of sound processing method and terminal device | |
CN108541116A (en) | Lamp light control method, terminal and computer readable storage medium | |
CN112382282A (en) | Voice denoising processing method and device, electronic equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |