CN105469656A - A spoken language learning system and its operation method - Google Patents
A spoken language learning system and its operation method Download PDFInfo
- Publication number
- CN105469656A CN105469656A CN201510821973.6A CN201510821973A CN105469656A CN 105469656 A CN105469656 A CN 105469656A CN 201510821973 A CN201510821973 A CN 201510821973A CN 105469656 A CN105469656 A CN 105469656A
- Authority
- CN
- China
- Prior art keywords
- module
- audio
- recording
- voice
- learning system
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims description 13
- 230000003044 adaptive effect Effects 0.000 claims description 14
- 238000001514 detection method Methods 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 4
- 238000012545 processing Methods 0.000 claims description 4
- 238000012549 training Methods 0.000 abstract description 7
- 238000012790 confirmation Methods 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000002789 length control Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/04—Electrically-operated educational appliances with audible presentation of the material to be studied
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- Educational Administration (AREA)
- Educational Technology (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Description
技术领域technical field
本发明涉及一种口语学习系统,以及该系统的运作方法。The invention relates to a spoken language learning system and an operation method of the system.
背景技术Background technique
英语口语的学习需要通过反复的听和说的训练,才能提高学习效率。而目前人们对于购买或下载的英语音频文件,一般都是使用单向的音频播放器,用户只能训练自己听的能力,不能及时的训练自己说的能力。The learning of spoken English needs repeated listening and speaking training in order to improve learning efficiency. At present, people generally use a one-way audio player for the English audio files purchased or downloaded. Users can only train their ability to listen, but cannot train their ability to speak in time.
有鉴于此,确有必要提供一种系统可以让用户利用常用的音频文件或网络音频实现听、说、确认的综合训练,以提高学习效率。In view of this, it is indeed necessary to provide a system that allows users to use commonly used audio files or network audio to implement comprehensive training in listening, speaking, and confirmation, so as to improve learning efficiency.
发明内容Contents of the invention
本发明为了解决上述问题而提供的一种口语学习系统,包括:用于音频文件解码的音频解码模块,用于自动计算寻找音频中语音断点的语音中断点搜索模块,用于播放和回放音频数据的音频播放模块,用于自适应地录制用户的语音的自适应录音模块,以及用于回放录音的录音回放模块,所述音频解码模块与所述语音中断点搜索模块连接,所述语音中断点搜索模块与所述音频播放模块连接。A kind of spoken language learning system that the present invention provides in order to solve the above-mentioned problem, comprises: the audio frequency decoding module that is used for audio file decoding, is used for automatic calculation and seeks the speech interruption point search module of speech breakpoint in audio frequency, is used for playing and replaying audio frequency An audio playback module for data, an adaptive recording module for adaptively recording the user's voice, and a recording playback module for playing back recordings, the audio decoding module is connected with the voice interruption point search module, and the voice interruption The point search module is connected with the audio playing module.
优选地,所述音频解码模块支持MP3或MVA等音频文件或在线音频流的解码。Preferably, the audio decoding module supports decoding of audio files such as MP3 or MVA or online audio streams.
优选地,所述音频解码模块支持每次读取任意长度的解码数据。Preferably, the audio decoding module supports reading decoding data of any length each time.
优选地,所述自适应录音模块具有支持语音降噪处理的降噪模块。Preferably, the adaptive recording module has a noise reduction module supporting speech noise reduction processing.
优选地,所述自适应录音模块将语音保存至一个录音文件MicFile中,所述录音回放模块可以自动触发所述录音文件MicFile。Preferably, the adaptive recording module saves the voice in a recording file MicFile, and the recording playback module can automatically trigger the recording file MicFile.
本发明还提供一种上述语学习系统的运作方法,所述运作方法包括:The present invention also provides an operation method of the above-mentioned language learning system, the operation method comprising:
步骤1、音频解码模块对音频文件解码;Step 1, the audio decoding module decodes the audio file;
步骤2、语音中断点搜索模块自动计算寻找音频中的语音中断点;Step 2, the voice interruption point search module automatically calculates and finds the voice interruption point in the audio;
步骤3、音频播放模块播放和回放音频数据;Step 3, the audio playback module plays and replays audio data;
步骤4、自适应录音模块自适应地录制用户的语音;Step 4, the adaptive recording module adaptively records the voice of the user;
步骤5、录音回放模块回放用户的录音。Step 5. The recording playback module plays back the user's recording.
优选地,在所述步骤2中,所述语音中断点搜索模块基于整个音频数据缓存或基于部分数据流自动计算寻找解码后的数据流里面的语音中断点。Preferably, in the step 2, the speech interruption point search module searches for the speech interruption point in the decoded data stream based on the entire audio data buffer or based on automatic calculation of part of the data stream.
优选地,在所述步骤2中,所述语音中断点搜索模块使用能量门限语音断点检测算法。Preferably, in the step 2, the speech break point search module uses an energy threshold speech break point detection algorithm.
优选地,在所述步骤4中,如果持续体格第一时间长度T1内没有出现有效语音,则自动结束录制;如果第一时间长度T1内出现有效语音,则进入静音段判断,如果持续一个第二时间长度T2出现静音段,则自动结束录制。Preferably, in said step 4, if there is no effective voice in the first time length T1, then automatically end the recording; if there is an effective voice in the first time length T1, then enter the silent section judgment, if last a second 2. If there is a silent segment in the time length T2, the recording will end automatically.
优选地,在所述步骤5之后,还包括以下步骤:Preferably, after said step 5, the following steps are also included:
步骤6、所述音频解码模块和语音中断点搜索模块进行后续的数据解码和断点检测。Step 6. The audio decoding module and the voice breakpoint search module perform subsequent data decoding and breakpoint detection.
本发明的有益效果在于:该口语学习系统有效地解决了英语学习中同时交互式训练听和说的能力的问题。只要基于普通的音频文件或网络音频流就可以实现收听、复述、确认的逐句循环训练,另外支持单句重复播放功能,可以显著提高口语的学习效率。The beneficial effect of the present invention is that: the spoken language learning system effectively solves the problem of simultaneous interactive training of listening and speaking abilities in English learning. As long as it is based on ordinary audio files or network audio streams, it can realize the sentence-by-sentence cycle training of listening, repeating, and confirming. In addition, it supports the single-sentence repeat playback function, which can significantly improve the learning efficiency of oral English.
附图说明Description of drawings
图1为本发明实施例提供的学习系统框架示意图。FIG. 1 is a schematic diagram of a learning system framework provided by an embodiment of the present invention.
具体实施方式detailed description
下面结合附图对本发明作进一步阐述:The present invention will be further elaborated below in conjunction with accompanying drawing:
本发明提供一种口语学习系统。该口语学习系统的输入对象为音频文件,其中主要是语音为主,不包括持续的背景音乐。The invention provides a spoken language learning system. The input object of the spoken language learning system is audio files, which are mainly speech and do not include continuous background music.
如图1所示,该口语学习系统包括音频解码模块,用于音频文件的解码;语音中断点搜索模块,用于自动计算寻找音频中的语音中断点;音频播放模块,用于播放和回放音频数据;自适应录音模块,用于自适应地录制用户的语音;录音回放模块,用于回放用户的录音。As shown in Figure 1, this spoken language learning system comprises audio decoding module, is used for the decoding of audio file; Speech interruption point search module, is used for automatic calculation and finds the speech interruption point in audio frequency; Audio playback module, is used for playing and replaying audio frequency data; an adaptive recording module for adaptively recording the user's voice; a recording playback module for playing back the user's recording.
音频解码模块与语音中断点搜索模块连接,将解码后的解码数据流传输至语音中断点搜索模块。语音中断点搜索模块与音频播放模块连接,将语音片段的数据传递给音频播放模块。The audio decoding module is connected with the voice interruption point search module, and transmits the decoded decoded data stream to the voice interruption point search module. The voice interruption point search module is connected with the audio playback module, and transmits the data of the voice segment to the audio playback module.
本发明还提供上述口语学习系统的运作方法,包括以下步骤:The present invention also provides the operation method of the above-mentioned spoken language learning system, comprising the following steps:
步骤1、音频解码模块对音频文件解码;Step 1, the audio decoding module decodes the audio file;
步骤2、语音中断点搜索模块自动计算寻找音频中的语音中断点;Step 2, the voice interruption point search module automatically calculates and finds the voice interruption point in the audio;
步骤3、音频播放模块播放和回放音频数据;Step 3, the audio playback module plays and replays audio data;
步骤4、自适应录音模块自适应地录制用户的语音;Step 4, the adaptive recording module adaptively records the voice of the user;
步骤5、录音回放模块回放用户的录音。Step 5. The recording playback module plays back the user's recording.
音频解码模块支持MP3或MVA等音频文件的解码处理,也支持在线音频流的解码,并支持每次读取任意长度的解码数据。针对不同的平台,可以选择合适的缓存大小,每次读取合适长度的解码数据PcmData。The audio decoding module supports the decoding processing of audio files such as MP3 or MVA, and also supports the decoding of online audio streams, and supports reading decoded data of any length each time. For different platforms, you can choose an appropriate cache size, and read the decoded data PcmData of an appropriate length each time.
语音中断点搜索模块可以基于整个音频数据缓存,也可以基于部分数据流自动计算寻找解码后的数据流里面的语音中断点,使用算法包括但不限于常用的能量门限语音断点检测等算法。如:基于前面得到的解码数据PcmData,以20ms或40ms帧为单位进行语音能量和过零率的计算,然后通过滑窗和门限判决,判断是否存在语音中断点。如果存在语音中断点,则记录断点信息,并在音频播放模块播放语音片段后启动录音模块。如果不存在语音中断点,则直接把数据传递给音频播放模块播放语音。The speech interruption point search module can be based on the entire audio data cache, or can automatically calculate and find the speech interruption point in the decoded data stream based on part of the data stream. The algorithm used includes but is not limited to commonly used energy threshold speech breakpoint detection algorithms. For example: based on the previously obtained decoded data PcmData, calculate speech energy and zero-crossing rate in units of 20ms or 40ms frames, and then judge whether there is a speech interruption point through sliding window and threshold judgment. If there is a voice break point, then record the break point information, and start the recording module after the audio playback module plays the voice clip. If there is no voice interruption point, the data is directly passed to the audio playback module to play the voice.
语音播放模块接收到前面的数据后直接播放,如果没有数据则自动停止播放。语音播放模块可以播放前面语音中断点搜索模块输出的语音片段数据;也可以重复播放某个指定的语音片段。The voice playback module plays directly after receiving the previous data, and automatically stops playing if there is no data. The voice playback module can play the voice segment data output by the previous voice interruption point search module; it can also play a specified voice segment repeatedly.
自适应录音模块可以自适应控制录音时间长度把用户语音输入录制保存为音频文件,同时自适应录音模块具有降噪模块,支持语音降噪处理。其中自适应控制时长的算法包括但不限于语音端点检测、自适应静音段长度控制等。自适应录音模块收到启动指令后,启动录音处理,自适应录音模块缓存麦克风设备输出的数据MicData,保存到一个录音文件MicFile中,同时对数据MicData进行断点检测。如果持续第一时间长度T1内没有出现有效语音则自动结束录制。如果第一时间长度T1内出现有效语音,则进入静音段判断,如果持续第二时间长度T2出现静音段,则自动结束录制。录音接收后,自动启动录音回放模块。The self-adaptive recording module can self-adaptively control the length of recording time and save the recording of the user's voice input as an audio file. At the same time, the self-adaptive recording module has a noise reduction module to support speech noise reduction processing. The algorithms for adaptive control duration include but are not limited to voice endpoint detection, adaptive silence segment length control, and the like. After the adaptive recording module receives the start command, it starts the recording process. The adaptive recording module caches the data MicData output by the microphone device, saves it in a recording file MicFile, and performs breakpoint detection on the data MicData. If there is no valid voice within the first duration T1, the recording is automatically terminated. If a valid voice appears within the first time length T1, it will enter the silent segment judgment, and if the silent segment occurs for the second time length T2, the recording will be automatically ended. After the recording is received, the recording playback module is automatically started.
录音回放模块可以自动触发播放用户的录音文件MicFile,用于用户对自己复述语音的自我确认。录音回放模块收到指令后开始播放录音文件MicFile,播放完毕后,包括以下步骤:通知音频解码模块和语音中断点搜索模块进行后续的数据解码和断点检测。The recording playback module can automatically trigger the playback of the user's recording file MicFile, which is used for the user's self-confirmation of his repeated voice. The recording playback module starts to play the recording file MicFile after receiving the instruction. After the playback is completed, the following steps are included: notify the audio decoding module and the voice breakpoint search module to perform subsequent data decoding and breakpoint detection.
期间如果用户输入指令,则通知音频解码模块从前面保存的中断点位置开始解码数据。During this period, if the user inputs an instruction, the audio decoding module is notified to start decoding data from the previously saved breakpoint position.
该口语学习系统有效地解决了英语学习中同时交互式训练听和说的能力的问题。只要基于普通的音频文件或网络音频流就可以实现收听、复述、确认的逐句循环训练,另外支持单句重复播放功能,可以显著提高口语的学习效率。The spoken language learning system effectively solves the problem of simultaneous interactive training of listening and speaking abilities in English learning. As long as it is based on ordinary audio files or network audio streams, it can realize the sentence-by-sentence cycle training of listening, repeating, and confirming. In addition, it supports the single-sentence repeat playback function, which can significantly improve the learning efficiency of oral English.
以上所述实施例,只是本发明的较佳实例,并非来限制本发明的实施范围,故凡依本发明申请专利范围所述的构造、特征及原理所做的等效变化或修饰,均应包括于本发明专利申请范围内。The above-described embodiments are only preferred examples of the present invention, and are not intended to limit the scope of the present invention, so all equivalent changes or modifications made according to the structure, features and principles described in the patent scope of the present invention should be Included in the patent application scope of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510821973.6A CN105469656A (en) | 2015-11-23 | 2015-11-23 | A spoken language learning system and its operation method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510821973.6A CN105469656A (en) | 2015-11-23 | 2015-11-23 | A spoken language learning system and its operation method |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105469656A true CN105469656A (en) | 2016-04-06 |
Family
ID=55607296
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510821973.6A Pending CN105469656A (en) | 2015-11-23 | 2015-11-23 | A spoken language learning system and its operation method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105469656A (en) |
Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN2333049Y (en) * | 1998-06-09 | 1999-08-11 | 刘兆有 | Intelligence foreign language learning machine |
KR100470736B1 (en) * | 2002-08-08 | 2005-03-10 | 인벤텍 코오포레이션 | Language listening and speaking training system and method with random test, appropriate shadowing and instant paraphrase functions |
CN1624685A (en) * | 2003-12-02 | 2005-06-08 | 英业达股份有限公司 | Paragraph language learning system and method |
CN1787070A (en) * | 2005-12-09 | 2006-06-14 | 北京凌声芯语音科技有限公司 | Chip upper system for language learner |
KR20070092604A (en) * | 2006-03-10 | 2007-09-13 | 김태훈 | Listening, Speaking, and Writing through Memory Enhancement |
CN201465325U (en) * | 2009-05-11 | 2010-05-12 | 刘正江 | Multi-mode automatic integral-semantic sentence identification learning machine |
KR20100072627A (en) * | 2008-12-22 | 2010-07-01 | 심명은 | Language teaching method for adjusting height of voice |
JP2011085641A (en) * | 2009-10-13 | 2011-04-28 | Power Shift Inc | Language learning support system and language learning support method |
CN103413550A (en) * | 2013-08-30 | 2013-11-27 | 苏州跨界软件科技有限公司 | Man-machine interactive language learning system and method |
CN105006179A (en) * | 2015-05-29 | 2015-10-28 | 广东小天才科技有限公司 | Method and device for following and reading content of voice input |
-
2015
- 2015-11-23 CN CN201510821973.6A patent/CN105469656A/en active Pending
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN2333049Y (en) * | 1998-06-09 | 1999-08-11 | 刘兆有 | Intelligence foreign language learning machine |
KR100470736B1 (en) * | 2002-08-08 | 2005-03-10 | 인벤텍 코오포레이션 | Language listening and speaking training system and method with random test, appropriate shadowing and instant paraphrase functions |
CN1624685A (en) * | 2003-12-02 | 2005-06-08 | 英业达股份有限公司 | Paragraph language learning system and method |
CN1787070A (en) * | 2005-12-09 | 2006-06-14 | 北京凌声芯语音科技有限公司 | Chip upper system for language learner |
KR20070092604A (en) * | 2006-03-10 | 2007-09-13 | 김태훈 | Listening, Speaking, and Writing through Memory Enhancement |
KR20100072627A (en) * | 2008-12-22 | 2010-07-01 | 심명은 | Language teaching method for adjusting height of voice |
CN201465325U (en) * | 2009-05-11 | 2010-05-12 | 刘正江 | Multi-mode automatic integral-semantic sentence identification learning machine |
JP2011085641A (en) * | 2009-10-13 | 2011-04-28 | Power Shift Inc | Language learning support system and language learning support method |
CN103413550A (en) * | 2013-08-30 | 2013-11-27 | 苏州跨界软件科技有限公司 | Man-machine interactive language learning system and method |
CN105006179A (en) * | 2015-05-29 | 2015-10-28 | 广东小天才科技有限公司 | Method and device for following and reading content of voice input |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6740504B1 (en) | Utterance classifier | |
CN104464723B (en) | A kind of voice interactive method and system | |
TWI659409B (en) | Speech point detection method and speech recognition method | |
CN108010515B (en) | A voice endpoint detection and wake-up method and device | |
CN108320733B (en) | Voice data processing method and device, storage medium and electronic equipment | |
RU2439716C2 (en) | Detection of telephone answering machine by voice recognition | |
CN110148402A (en) | Method of speech processing, device, computer equipment and storage medium | |
WO2017084360A1 (en) | Method and system for speech recognition | |
WO2017012511A1 (en) | Voice control method and device, and projector apparatus | |
WO2012055113A1 (en) | Method and system for endpoint automatic detection of audio record | |
US20130066630A1 (en) | Audio transcription generator and editor | |
JP2017021125A5 (en) | Voice dialogue apparatus and voice dialogue method | |
JP5051882B2 (en) | Voice dialogue apparatus, voice dialogue method, and robot apparatus | |
CN104021789A (en) | Self-adaption endpoint detection method using short-time time-frequency value | |
JP2014240940A (en) | Dictation support device, method and program | |
WO2016063879A1 (en) | Speech synthesis device and method | |
US20210118464A1 (en) | Method and apparatus for emotion recognition from speech | |
US8571873B2 (en) | Systems and methods for reconstruction of a smooth speech signal from a stuttered speech signal | |
CN116420188A (en) | Speech filtering of other speakers from call and audio messages | |
WO2013138122A2 (en) | Automatic realtime speech impairment correction | |
WO2023116243A1 (en) | Data conversion method and computer storage medium | |
JP6448950B2 (en) | Spoken dialogue apparatus and electronic device | |
JP2009175179A (en) | Speech recognition device, program and utterance signal extraction method | |
JP2006079089A (en) | Baseband modem for speech recognition and mobile communication terminal using baseband modem | |
JP5223843B2 (en) | Information processing apparatus and program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20160406 |