WO2019015613A1 - Electronic-book voice playback method, apparatus, and terminal device - Google Patents
Electronic-book voice playback method, apparatus, and terminal device Download PDFInfo
- Publication number
- WO2019015613A1 WO2019015613A1 PCT/CN2018/096162 CN2018096162W WO2019015613A1 WO 2019015613 A1 WO2019015613 A1 WO 2019015613A1 CN 2018096162 W CN2018096162 W CN 2018096162W WO 2019015613 A1 WO2019015613 A1 WO 2019015613A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio
- content
- book
- voice
- played
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
- G11B2020/10537—Audio or video recording
- G11B2020/10546—Audio or video recording specifically adapted for audio data
Definitions
- Embodiments of the present invention relate to the field of electronic book data processing technologies, and in particular, to an electronic book voice playing method, apparatus, and terminal device.
- An e-book is a publication that digitizes information such as text, pictures, sounds, and images using computer technology.
- traditional paper reading methods have gradually been replaced by e-books.
- People are increasingly using Internet and computer technology to download e-books through e-book reading applications for reading e-books. Read it.
- the embodiments of the present invention provide a method, a device, and a terminal device for playing an e-book voice, so as to solve the problem that the user reads the e-book under the condition of eye fatigue or poor light.
- a method for playing an e-book voice includes: determining an e-book content to be played by a voice according to a voice play instruction for instructing an e-book to perform voice playback; obtaining the e-book The content corresponds to the real vocal audio and plays the real vocal audio.
- an electronic book voice playback apparatus including: a content determining module, configured to determine an e-book to be played by voice according to a voice play instruction for instructing an e-book to perform voice play And an audio playing module, configured to obtain real vocal audio corresponding to the e-book content, and play the real vocal audio.
- a terminal device includes: a processor, a memory, a communication interface, and a communication bus, wherein the processor, the memory, and the communication interface are completed by using the communication bus Communication with each other; the memory is for storing at least one executable instruction that causes the processor to perform an operation corresponding to the e-book voice playback method as described above.
- the e-book voice playing solution provided by the embodiment of the invention can perform the voice playing of the corresponding e-book content through the voice playing instruction in the case of the user's eye fatigue or poor light, thereby realizing the "listening" of the e-book reading application. "Features. Moreover, in the embodiment of the present invention, real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- FIG. 1 is a flow chart showing the steps of a method for playing an e-book voice according to a first embodiment of the present invention
- FIG. 2 is a flow chart showing the steps of a method for playing an e-book voice according to a second embodiment of the present invention
- FIG. 3 is a block diagram showing the structure of an electronic book voice playing device according to a third embodiment of the present invention.
- FIG. 4 is a block diagram showing the structure of an electronic book voice playback apparatus according to Embodiment 4 of the present invention.
- FIG. 5 is a schematic structural diagram of a terminal device according to Embodiment 5 of the present invention.
- FIG. 1 a flow chart of steps of an e-book voice playing method according to a first embodiment of the present invention is shown.
- Step S102 Determine an e-book content to be played by the voice according to a voice play instruction for instructing the e-book to perform voice play.
- the generation of the voice play instruction may be implemented in any suitable manner, including but not limited to: receiving the user's operation on the voice play button or option displayed in the e-book interface, or receiving the user's display of the e-book page.
- the setting operation (such as double-clicking, clicking, long-pressing) is generated, or is received after the user performs the voice playing setting through the corresponding setting menu, and the like, which is not limited by the embodiment of the present invention.
- the content of the e-book to be played by the voice may be the content set by the e-book reading application, such as the entire content of the currently displayed e-book, or one or more segments, one or more lines, one or more sentences selected by the user. And so on.
- Step S104 Obtain real vocal audio corresponding to the content of the e-book to be played by the voice, and play the real vocal audio.
- the real vocal audio corresponding to the content of the e-book can be obtained, and then played.
- the real vocal audio is the voice generated by the real person's voice, such as audio generated by a real person reading aloud, or audio generated by a real person's dialogue, or audio generated by processing a real human voice. (such as the audio generated by re-splitting and re-synthesizing sentences that have been read by real people) and so on.
- the user when the user is tired or the light is bad, the user can perform the voice playing of the corresponding e-book content through the voice playing instruction, thereby realizing the "listening" of the e-book reading application.
- real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- the e-book voice playing method of this embodiment may be executed by any suitable device having data processing capability, including but not limited to: various terminal devices (including PCs, tablets, mobile terminals, etc.) and servers.
- FIG. 2 a flow chart of steps of an e-book voice playing method according to a second embodiment of the present invention is shown.
- Step S202 Determine the e-book content to be played by the voice according to the voice play instruction for performing voice play on the electronic book and the selection operation of the display content of the electronic book.
- the device where the e-book application is located receives the corresponding user.
- a corresponding voice play instruction is generated to indicate that the corresponding e-book content is played by voice.
- the content of the e-book to be played by the voice may be the content set by the e-book reading application by default, or may be the content selected by the user.
- an e-book voice play solution provided by an embodiment of the present invention is described by taking a user selection as an example.
- the user When the user selects the content of the e-book to be played by the voice, the user can select a certain segment or a certain segment of the content of the e-book, a certain line or a certain number of lines, the content of a certain sentence or a certain sentence, etc., by which the method can improve
- the flexibility of the user's "listening to the book” content enhances the user's "listening to the book” experience.
- the e-book content to be voice-played by default in the e-book reading application described in the first embodiment can also be applied to the solution of the embodiment.
- the operation of the user to indicate the voice play and the operation of the user to select the e-book content may be in any suitable order.
- the voice playback may be first indicated by an appropriate method, and then the e-book content may be selected; or the e-book content may be selected first, and then the selected e-book content may be voice-played.
- the latter embodiment is taken as an example to describe the solution of the embodiment of the present invention.
- those skilled in the art can implement the e-book voice playing solution based on the previous mode by referring to the embodiment.
- the selection operation of the display content of the e-book may be first received, and the e-book content to be played by the voice is determined according to the selection operation.
- a first operation of the display content of the electronic book by the user may be received, a first action point of the first operation in the display content is determined, and a second operation of the display content by the user is received, Determining a second action point of the second operation in the display content; determining display content between the first action point and the second action point as the e-book content to be played by the voice.
- the first operation and the second operation include, but are not limited to, a click operation.
- the user may receive a third operation of the display content of the electronic book, determine a third action point of the third operation in the display content, and use the third action point as a reference point, which will include
- the display content in the first setting range including the third action point is determined as the electronic book content to be played by the voice; or the display content in the second setting range starting from the third action point is determined as the to-be-voiced
- the content of the e-book to be played; or, the content of the third setting range ending with the third point of action is determined as the content of the e-book to be played by the voice.
- the first setting range, the second setting range, and the third setting range may be the same or different, and may be set by a person skilled in the art according to actual needs.
- the display content in the first setting range is determined as the content of the electronic book to be played by the voice, but is not limited thereto, and the third action point may not be the end point.
- the third operation includes, but is not limited to, a click operation. In this way, user operations are simplified and the operating burden of the system is reduced.
- the user may receive a selection operation of the display content of the electronic book, determine a content tag corresponding to the display content selected by the selection operation, and determine the content marked by the content tag as the to-be-voiced voice.
- a corresponding content mark is preset in the e-book content, and the content mark can be set by a person skilled in the art according to actual needs, such as setting a content mark for each chapter or each section, or setting one for each page.
- each segment is set to a content tag, or, based on an analysis of the e-book content, each complete episode (such as the teacher and student's dialogue in the classroom) or each complete scene (such as a sea scene) Set a content tag, and more.
- a selection operation for example, a certain portion of the e-book content is selected by the first operation and the second operation; or, a click operation is performed at any position of the currently displayed e-book content, such as The third operation mode; or, when the content tag is displayed to the user in an appropriate prompt manner in the e-book, after the user operates the corresponding prompt, the e-book reading application first determines the corresponding content tag, and further, the content is The entire portion of the e-book content marked by the tag is determined as the e-book content to be played by the voice.
- the method is not limited to the above manner.
- other suitable manners for determining the content of the e-book to be played by the voice are also applicable to the solution of the embodiment of the present invention, such as determining the content of the entire page currently displayed by the e-book as the to-be-voiced voice.
- Step S204 Obtain real vocal audio corresponding to the content of the e-book to be played by the voice, and play the real vocal audio.
- the real vocal audio includes at least one of the following: a film and television audio obtained from a movie drama corresponding to the electronic book; a spoken audio corresponding to the electronic book content of the electronic book; and a user recording of the electronic book reading application where the electronic book is located User audio.
- the electronic book "Three Kingdoms" corresponds to the original sound of the original voice, and in this case, the start position of the audio corresponding to the content of the electronic book to be played by the voice can be determined, and the play is performed from the home position.
- the user of the e-book reading application reads all or part of the content of the e-book and records it into audio, or combines the e-book content for voice commenting and saving it as audio, in the case where the audio can be used, such as
- the audio is set by the user to be shared, or sent to others, or published in an appropriate way in an e-book reading application, such as by e-book comment posting or by sharing or by other appropriate means, etc.
- the audio can be used to implement the "listening".
- the user before determining the e-book content to be played by the voice according to the voice play instruction of the e-book, the user can also receive the spoken audio recorded by the user through the e-book reading application for the content of the e-book, and the recorded audio and The content of the corresponding e-book is stored in association; and/or, the user receives the comment audio recorded by the e-book reading application for the content of the e-book, and associates the comment audio with the content of the corresponding e-book.
- the "listening" function is realized based on the recorded audio of the user recorded and associated storage, further enhancing the user's experience of using the e-book reading application.
- the above-mentioned real vocal audio can be further processed, such as splitting and re-synthesizing to meet the real vocal audio playing needs in certain situations, such as video
- the real vocal audio can also be synthesized with the background audio and/or the business audio to generate synthesized audio, in which case the synthesized audio corresponding to the electronic book content to be played by the voice will be obtained, wherein the synthesized audio includes In addition to the real vocal audio, background audio and/or service audio is also included; and the synthesized audio is played.
- the background audio can be background music, and the background audio can further enhance the atmosphere, so that the user can feel the atmosphere of the part of the e-book content;
- the service audio can be the business audio recorded by the person in the current real vocal audio, or It is a business audio related to the content of the e-book to be played by voice, such as a story-related business audio.
- the business audio can be inserted at any appropriate position at the beginning, end, or beginning to end of the current real vocal audio.
- the business audio can be implemented as an advertising audio.
- the content tag may be pre-set for the e-book, and the audio tag may be pre-set for the real vocal audio. That is, at least one content tag for marking the content of the e-book is pre-set in the e-book, and at least one audio tag for marking the audio content is pre-set in the real vocal audio, based on which the content tag can be Correspondence between audio tags, obtaining real vocal audio corresponding to the contents of the e-book.
- a content tag corresponding to the e-book content to be played by the voice may be determined; and an audio tag corresponding to the content tag is determined according to the correspondence between the pre-stored content tag and the audio tag; and the determined audio tag is obtained Corresponding audio content.
- the content mark and audio mark the real vocal audio corresponding to the e-book content can be obtained quickly and accurately, and the response speed of the "listening" function to the user operation is improved.
- the e-book to be played by the voice may be determined in advance (for example, in a voice play instruction for performing voice play on the e-book according to the indication) Before the step of content) performing voice recognition on the existing or acquired real vocal audio, obtaining the corresponding text content; determining the e-book content in the e-book that matches the text content; establishing and storing the real content corresponding to the text content The correspondence between the vocal audio and the determined e-book content.
- voice recognition is performed on a piece of video audio of a period of 30 minutes, and corresponding multi-segment text content is obtained; further, the multi-segment text content is respectively matched with the e-book content, and the multi-segment text content and the e-book are determined according to the matching result.
- Corresponding relationship between multiple pieces of content; further, according to the relationship between the two the correspondence between the plurality of parts of the real vocal audio corresponding to the plurality of pieces of text content recognized by the speech and the contents of the plurality of pieces of e-book contents can be established and stored relationship. Based on this, when the real vocal audio corresponding to the e-book content to be played by the voice is obtained, the real vocal audio corresponding to the e-book content to be played by the voice can be obtained according to the correspondence.
- real vocal audio includes a plurality of, for example, at least two of the audio-visual line audio, the e-book content reading audio, and the user audio
- real vocal audio corresponding to the e-book content can be obtained from at least two of the audio and video audio, the e-book content reading audio, and the user audio according to a preset priority; or Receiving, by the user, a selection operation of at least two corresponding options of the audio-visual line audio, the e-book content reading audio, and the user audio, and obtaining the real vocal audio corresponding to the e-book content selected by the selecting operation; or
- the user may also determine the audio type preference of the user according to the historical data of the user playing the real vocal audio; according to the user's audio type preference, at least two of the audio and video audio, the e-book content reading audio, and the user audio are obtained and to be obtained.
- the real vocal audio corresponding to the e-book content of the voice playback For example, the user's historical data indicates that the user has had ten voice playback records. Among them, the audio and video audio is used eight times. When the user plays the voice again, the audio and video audio can be directly used to perform the corresponding e-book content. Voice playback.
- the priority of setting three types of audio such as audio and video audio, e-book content reading audio, and user audio is from high to low: user audio, audio and video audio, and e-book content reading audio.
- the user audio is played; and if a part of the text of the e-book only corresponds to some of the audio, for example, the audio and electronic contents of the electronic book and the e-book content are read aloud.
- Audio will play the film and television audio, and if the part of the text only corresponds to the e-book content reading audio, the e-book content will be played aloud audio.
- priority setting is only an exemplary description, and may be appropriately set by a person skilled in the art according to actual needs, which is not limited by the embodiment of the present invention. By setting the priority, it is possible to ensure, as much as possible, that the e-book text corresponds to audio, and the form of the audio is diversified.
- the user is provided with greater flexibility in selecting the real vocal audio corresponding to the e-book content, and the user can select the audio and play it.
- the options corresponding to the audio and video audio, the e-book content reading audio, and the user audio may be appropriately set by a person skilled in the art according to actual needs.
- the audio and video audio may be displayed through a pop-up window or a transparent overlay.
- the e-book content reads the audio and user audio options.
- the e-book application After receiving a voice play instruction for performing voice playback on a part of the e-book content, the e-book application presents a corresponding audio option to the user through a pop-up window or a transparent overlay layer for the user to select, and after playing the user's selection result, playing The real vocal audio corresponding to the selection result, for example, if the user selects the film and television word audio, the audio and video audio corresponding to the part of the electronic book content is played. Based on the interface for displaying the content of the e-book, the audio option is displayed through the pop-up window or the transparent overlay layer, which facilitates the user's operation and improves the user experience.
- step S206 or step S208 can be further performed.
- Step S206 in the process of playing the real vocal audio, receiving the page turning operation of the e-book, suspending the playing of the real vocal audio; re-determining the e-book content to be played by the voice according to the page turning operation; obtaining and re-creating The actual vocal audio corresponding to the determined e-book content is played and played.
- the audio has not been played yet, and the user has performed corresponding operations, such as page turning or page turning, and the e-book reading application is monitored during the audio playback process.
- the playing of the audio is automatically suspended; further, the e-book content to be played by the voice is re-determined according to the page turning operation, for example, determining the final target page of the page turning operation, and then re-creating according to the content of the target page. Determine the content of the e-book to be played.
- the current real vocal audio is playing the content of the first sentence of the third paragraph of the fifth page of the e-book.
- the user performs a continuous page turning operation, and finally stops at the e-book.
- Page 10 of the page in this case, you can stop the previous audio and play the real vocal audio of the e-book content on page 10 (such as the audio corresponding to the content tag of the first e-book on page 10, or , the audio corresponding to the start text on page 10, or the audio of the scene on page 10 or the scene, etc.); it is also possible to stop the previous audio and receive the user's selection of the e-book content on page 10, The real vocal audio corresponding to the e-book content selected by the selection operation is played.
- the page turning operation is similar to the page turning operation, and will not be described here.
- the previous audio may be stopped, and the real vocal audio corresponding to the e-book content of the fifth page may be re-determined, for example, the audio corresponding to the content mark of the first e-book of the fifth page, or, page 5
- the initial text corresponds to the audio, or the episode on page 5 or the audio corresponding to the scene, and so on.
- the way of continuing the playback of the real vocal audio before the interruption is closer to the real needs of the user "listening to the book” than the other methods, and improving the user's "listening to the book” experience.
- Step S208 In the process of playing the real vocal audio, receiving an audio processing instruction for the played real vocal audio, and performing an operation indicated by the audio processing instruction on the real vocal audio.
- the audio processing instruction includes, but is not limited to, at least one of: a pause instruction for instructing suspension of real human voice audio playback, a first adjustment instruction for indicating a playback speed of adjusting real human voice audio, and an instruction for adjusting the real person.
- a second adjustment instruction of the playback progress of the audio and audio an exit instruction for instructing the exit of the real human voice audio, and a switching instruction for indicating the type of switching the real human voice audio.
- the user may send a pause instruction to the e-book reading application by operating the “pause” or the similar operation option to pause the playing of the current audio; or, when it is detected that the user interrupts the e-book reading application and uses other applications, the e-book reading application can automatically generate a corresponding pause instruction to suspend the playback of the current audio.
- the user may send an exit instruction indicating that the real human voice audio is exited to the e-book reading application by operating a “stop” or the like operation option to stop the playing of the current real human voice audio.
- the user may perform a selection operation on other audio types displayed, or by " A switch vocal" or similar operation option sends a switch instruction to the e-book reading application indicating the type of switching real vocal audio.
- the current real vocal audio is user audio
- the user selects one of a plurality of displayed audio types by the operation of the “switch vocal” operation option, for example, switching the user audio to the audio and video audio or electronic The contents of the book read the audio.
- the first adjusting instruction for adjusting the playing speed of the real vocal audio can be sent to the e-book reading application through the corresponding playing speed adjusting operation option to adjust the playing speed of the current audio.
- the playback speed of the current real vocal audio will be adjusted to 2 times of the original playback speed.
- the second adjustment instruction indicating that the playback progress of the real vocal audio is adjusted may be sent to the e-book reading application through the corresponding play progress adjustment operation option.
- the user can adjust the playing progress of the current real vocal audio by clicking the “fast forward” or similar operation option, or by dragging the audio playback progress bar.
- the foregoing audio processing instructions may be implemented by any suitable setting by those skilled in the art.
- the audio processing instructions may be displayed by a floating icon or a floating window or a transparent overlay.
- the user when the user is tired or the light is bad, the user can perform the voice playing of the corresponding e-book content through the voice playing instruction, thereby realizing the "listening" of the e-book reading application.
- real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- the e-book voice playing method of this embodiment may be executed by any suitable device having data processing capability, including but not limited to: various terminal devices (including PCs, tablets, mobile terminals, etc.) and servers.
- FIG. 3 a block diagram of a structure of an electronic book voice playback apparatus according to a third embodiment of the present invention is shown.
- the e-book voice playing device of the embodiment includes: a content determining module 302, configured to determine an e-book content to be played by the voice according to a voice playing instruction for instructing the e-book to perform voice playing; and an audio playing module 304, configured to obtain and The e-book content corresponds to real vocal audio and plays the real vocal audio.
- the user when the user is tired or the light is bad, the user can perform the voice playing of the corresponding e-book content through the voice playing instruction, thereby realizing the "listening" of the e-book reading application.
- real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- FIG. 4 a block diagram of a structure of an electronic book voice playback apparatus according to a fourth embodiment of the present invention is shown.
- the e-book voice playing device of the embodiment includes: a content determining module 402, configured to determine an e-book content to be played by the voice according to a voice playing instruction for instructing the e-book to perform voice playing; and an audio playing module 404, configured to obtain The e-book content corresponds to real vocal audio and plays the real vocal audio.
- the real vocal audio includes at least one of: a film and television audio obtained from a movie drama corresponding to the electronic book; a reading audio corresponding to the electronic book content of the electronic book; and an e-book reading application where the electronic book is located User audio recorded by the user.
- the audio playing module 404 is configured to obtain synthesized audio corresponding to the electronic book content to be played, wherein the synthesized audio includes background audio and/or service audio in addition to the real human voice audio; and is used for playing The synthesized audio.
- At least one content tag for marking the content of the e-book is pre-set in the e-book, and at least one audio tag for marking the audio content is pre-set in the real vocal audio;
- the audio playing module 404 is configured to Marking a correspondence relationship with the audio mark, obtaining real vocal audio corresponding to the electronic book content, and playing the real vocal audio.
- the audio play module 404 is configured to determine a content mark corresponding to the electronic book content to be played by the voice; determine an audio mark corresponding to the content mark according to the corresponding relationship between the pre-stored content mark and the audio mark; acquire and determine The audio tag corresponds to the audio content and plays the audio content.
- the e-book voice playing device of the embodiment further includes: a relationship establishing module 406, configured to determine, according to the voice playing instruction for instructing the e-book to perform voice playing, the content of the e-book to be played by the voice Previously, speech recognition is performed on real vocal audio to obtain corresponding text content; e-book content matching the text content in the e-book is determined; real vocal audio corresponding to the text content and determined electronic are established and stored Corresponding relationship between the contents of the book; the audio playing module 404 is configured to obtain, according to the correspondence between the real vocal audio corresponding to the text content and the determined content of the electronic book, corresponding to the content of the electronic book to be played by the voice Real vocal audio and play the real vocal audio.
- a relationship establishing module 406 configured to determine, according to the voice playing instruction for instructing the e-book to perform voice playing, the content of the e-book to be played by the voice Previously, speech recognition is performed on real vocal audio to obtain corresponding text content; e-book content matching the
- the audio playing module 404 is configured to use the audio and television content and the e-book content according to the preset priority. Reading at least two of the audio and the user audio, obtaining real vocal audio corresponding to the e-book content, and playing the real vocal audio; or, the audio playing module 404 is configured to receive the user's audio and video a book content reading audio, and a selection operation of at least two corresponding options in the user audio, obtaining real vocal audio selected by the selection operation corresponding to the e-book content, and playing the real vocal Audio; or, the audio playing module 404 is configured to determine the user's audio type preference according to the historical data of the user playing the real vocal audio; according to the user's audio type preference, from the audio and video audio, the e-book content, the audio and the user audio In at least two, real vocal audio corresponding to the content of the e-book to be played by the voice is
- the electronic book voice playing device of the embodiment further includes: a display module 408, configured to receive, by the audio playing module 404, at least two corresponding options of the user for the audio and video audio, the electronic book content reading audio, and the user audio. Before the selection operation, at least two corresponding options of the audio and video audio, the e-book content reading audio, and the user audio are displayed through a pop-up window or a transparent overlay.
- the content determining module 402 is configured to determine, according to the voice play instruction for performing voice play on the electronic book and the selection operation of the display content of the electronic book, the electronic book content to be played by the voice.
- the electronic book voice playing device of the embodiment further includes: a content selection module 410, configured to select, according to the voice playing instruction for instructing the electronic book to perform voice playing, and the selection content of the electronic book in the content determining module 402 The operation, before determining the content of the electronic book to be played by the voice, receives a selection operation of the display content of the electronic book, and determines the content of the electronic book to be played by the voice according to the selection operation.
- a content selection module 410 configured to select, according to the voice playing instruction for instructing the electronic book to perform voice playing, and the selection content of the electronic book in the content determining module 402 The operation, before determining the content of the electronic book to be played by the voice, receives a selection operation of the display content of the electronic book, and determines the content of the electronic book to be played by the voice according to the selection operation.
- the content selection module 410 includes: a first selection module 4102, configured to receive a first operation of the display content of the electronic book by the user, determine a first action point of the first operation in the display content, and receive the display content of the user The second operation determines a second action point of the second operation in the display content; determining display content between the first action point and the second action point as the e-book content to be played by the voice.
- a first selection module 4102 configured to receive a first operation of the display content of the electronic book by the user, determine a first action point of the first operation in the display content, and receive the display content of the user The second operation determines a second action point of the second operation in the display content; determining display content between the first action point and the second action point as the e-book content to be played by the voice.
- the content selection module 410 includes: a second selection module 4104, configured to receive a third operation of the display content of the electronic book by the user, determine a third action point of the third operation in the display content;
- the action point is a reference point, and the display content in the first setting range including the third action point is determined as the content of the e-book to be played by the voice; or the second setting range starting from the third action point
- the display content inside is determined as the e-book content to be played by the voice; or the display content in the third setting range ending with the third action point is determined as the e-book content to be played by the voice.
- the content selection module 410 includes: a third selection module 4106, configured to receive a user's selection operation on the display content of the electronic book, determine a content tag corresponding to the display content selected by the selection operation, and mark the content The marked content is determined as the e-book content to be played by the voice.
- a third selection module 4106 configured to receive a user's selection operation on the display content of the electronic book, determine a content tag corresponding to the display content selected by the selection operation, and mark the content The marked content is determined as the e-book content to be played by the voice.
- the e-book voice playing device of the embodiment further includes: a recording storage module 412, configured to determine, according to the voice playing instruction for instructing the e-book to perform voice playing, the content determining module 402 to determine the e-book content to be played by the voice Previously, receiving the spoken audio recorded by the user through the e-book reading application for the content of the e-book, associating the recorded audio with the content of the corresponding e-book; and/or receiving the user recording the content of the e-book through the e-book reading application
- the comment audio stores the comment audio associated with the content of the corresponding e-book.
- the e-book voice playback device of this embodiment further includes: an audio processing module 414, configured to receive an audio processing instruction for the played real vocal audio, and perform the audio processing instruction on the real vocal audio Indicated action.
- an audio processing module 414 configured to receive an audio processing instruction for the played real vocal audio, and perform the audio processing instruction on the real vocal audio Indicated action.
- the audio processing instruction includes at least one of: a pause instruction for instructing suspension of the real human voice audio playback, a first adjustment instruction for indicating a playback speed of the real human voice audio, And a second adjustment instruction indicating that the playback progress of the real vocal audio is adjusted, an exit instruction for instructing to exit the real vocal audio play, and a switching instruction for indicating a type of switching the real vocal audio.
- the display module 408 is further configured to display the audio processing instruction by using a floating icon or a floating window or a transparent overlay.
- the e-book voice playback device of the embodiment further includes: a re-determination module 416, configured to receive a page turning operation on the e-book during the process of playing the real vocal audio, and suspend the real vocal audio Playback; re-determine the e-book content to be played by the voice according to the page turning operation; obtain real vocal audio corresponding to the re-determined e-book content and play.
- a re-determination module 416 configured to receive a page turning operation on the e-book during the process of playing the real vocal audio, and suspend the real vocal audio Playback; re-determine the e-book content to be played by the voice according to the page turning operation; obtain real vocal audio corresponding to the re-determined e-book content and play.
- the e-book voice playback device of the present embodiment is used to implement the corresponding e-book voice playback method in the foregoing multiple method embodiments, and has the beneficial effects of the corresponding method embodiments, and details are not described herein again.
- FIG. 5 a schematic structural diagram of a terminal device according to Embodiment 5 of the present invention is shown.
- the specific implementation of the present invention does not limit the specific implementation of the terminal device.
- the terminal device may include a processor 502, a communications interface 504, a memory 506, and a communication bus 508.
- Processor 502, communication interface 504, and memory 506 complete communication with one another via communication bus 508.
- the communication interface 504 is configured to communicate with network elements of other devices, such as other terminal devices or servers.
- the processor 502 is configured to execute the program 510, and specifically, the related steps in the foregoing embodiment of the electronic book voice playing method.
- program 510 can include program code, the program code including computer operating instructions.
- the processor 502 may be a central processing unit CPU, or an Application Specific Integrated Circuit (ASIC), or one or more integrated circuits configured to implement embodiments of the present invention.
- the one or more processors included in the terminal device may be the same type of processor, such as one or more CPUs; or may be different types of processors, such as one or more CPUs and one or more ASICs.
- the memory 506 is configured to store the program 510.
- Memory 506 may include high speed RAM memory and may also include non-volatile memory, such as at least one disk memory.
- the program 510 may be specifically configured to cause the processor 502 to: determine the e-book content to be played by the voice according to the voice play instruction indicating the voice play of the e-book; and obtain the real vocal audio corresponding to the e-book content. And play the real vocal audio.
- the real vocal audio includes at least one of: audio and video audio obtained from a movie drama corresponding to the electronic book; reading audio corresponding to the electronic book content of the electronic book; User audio recorded by the user of the e-book reading application.
- the program 510 is further configured to enable the processor 502 to obtain and play the real vocal audio corresponding to the e-book content to be played, and play the real vocal audio.
- At least one content tag for marking the content of the e-book is pre-set in the e-book, and at least one audio tag for marking the audio content is pre-set in the real vocal audio; the program 510 is also used to When the processor 502 obtains the real vocal audio corresponding to the e-book content, according to the correspondence between the content tag and the audio tag, obtaining a real vocal corresponding to the e-book content Audio.
- the program 510 is further configured to enable the processor 502 to obtain real vocal audio corresponding to the e-book content according to the correspondence between the content tag and the audio tag. Determining a content tag corresponding to the content of the e-book to be played by the voice; determining an audio tag corresponding to the content tag according to the correspondence between the pre-stored content tag and the audio tag; acquiring the audio tag corresponding to the determined Audio content.
- the program 510 is further configured to cause the processor 502 to perform voice on the real vocal audio before determining the e-book content to be played by the voice according to the voice play instruction for performing the voice play on the electronic book according to the indication. Identifying, obtaining corresponding text content; determining e-book content in the e-book that matches the text content; establishing and storing between the real vocal audio corresponding to the text content and the determined content of the e-book Corresponding relationship; the program 510 is further configured to: when the processor 502 obtains the real vocal audio corresponding to the e-book content to be played, obtain the real vocal corresponding to the e-book content to be played according to the correspondence relationship Audio.
- the program 510 is further configured to cause the processor 502 to obtain and When the real vocal audio corresponding to the e-book content corresponds to at least two of the audio-visual line audio, the e-book content reading audio, and the user audio, the content corresponding to the e-book content is obtained according to a preset priority.
- Real vocal audio or, receiving a user's selection operation of at least two corresponding options of the audio-visual line audio, the e-book content reading audio, and the user audio, and obtaining the e-book content selected by the selection operation
- Corresponding real vocal audio or, according to the historical data of the user playing real vocal audio, determining the user's audio type preference; according to the user's audio type preference, from the audio and video audio, the e-book content reading audio, and the user audio In at least two, real vocal audio corresponding to the content of the e-book to be played is obtained.
- the program 510 is further configured to cause the processor 502 to pass the user's selection operation of the at least two corresponding options of the station audio, the e-book content reading audio, and the user audio.
- the pop-up window or transparent overlay displays options corresponding to at least two of the audio and video audio, the e-book content reading audio, and the user audio.
- the program 510 is further configured to: when the processor 502 determines the e-book content to be played by the voice according to the voice play instruction for performing the voice play on the electronic book according to the instruction, perform voice on the e-book according to the indication.
- the played voice play command and the selection operation of the display content of the e-book determine the content of the e-book to be played by the voice.
- the program 510 is further configured to: determine, by the processor 502, a voice play instruction for performing voice play on the electronic book according to the indication and a selection operation on the display content of the electronic book, and determine an electronic book to be played by the voice. Before the content, a selection operation of the display content of the electronic book is received, and the electronic book content to be played by the voice is determined according to the selection operation.
- the program 510 is further configured to: when the processor 502 receives the selection operation of the display content of the electronic book, and determines the electronic book content to be played by the voice according to the selecting operation, receiving the user to the electronic a first operation of displaying content of the book, determining a first action point of the first operation in the display content; receiving a second operation of the display content by the user, determining a second operation of the second operation in the display content The action point; the display content between the first action point and the second action point is determined as the content of the e-book to be played by the voice.
- the program 510 is further configured to: when the processor 502 receives the selection operation of the display content of the electronic book, and determines the electronic book content to be played by the voice according to the selecting operation, receiving the user to the electronic a third operation of displaying the content of the book, determining a third action point of the third operation in the display content; using the third action point as a reference point, the first set range including the third action point
- the display content is determined as the e-book content to be played by the voice; or the display content in the second setting range starting from the third action point is determined as the e-book content to be played by the voice; or, the third action point is to be
- the display content in the third setting range of the end point is determined as the content of the e-book to be played by the voice.
- the program 510 is further configured to: when the processor 502 receives the selection operation of the display content of the electronic book, and determines the electronic book content to be played by the voice according to the selecting operation, receiving the user to the electronic a selection operation of the display content of the book, determining a content tag corresponding to the display content selected by the selection operation; and determining the content marked by the content tag as the e-book content to be played by the voice.
- the program 510 is further configured to: when the processor 502 determines the e-book content to be played by the voice according to the voice play instruction of the e-book, receive the content that the user reads the application into the e-book through the e-book. Recording aloud audio, storing the recorded audio in association with the content of the corresponding e-book; and/or receiving the comment audio recorded by the user through the e-book reading application for the content of the e-book, and the content of the comment audio and the corresponding e-book Associate storage.
- the program 510 is further configured to cause the processor 502 to receive an audio processing instruction for the played real vocal audio, the real vocal audio being subjected to the operation indicated by the audio processing instruction.
- the audio processing instruction includes at least one of: a pause instruction for instructing suspension of real human voice audio playback, a first adjustment instruction for indicating a playback speed of adjusting real human voice audio, a second adjustment instruction for instructing adjustment of the playback progress of the real vocal audio, an exit instruction for indicating the exit of the real vocal audio playback, and a switching instruction for indicating the type of switching the real vocal audio.
- the program 510 is further configured to cause the processor 502 to display the audio processing instructions via a floating icon or a floating window or a transparent overlay.
- the program 510 is further configured to enable the processor 502 to receive a page turning operation on the e-book during the playing of the real human voice audio, and suspend the playing of the real human voice audio;
- the page turning operation redetermines the content of the e-book to be played by the voice; the real vocal audio corresponding to the content of the re-determined e-book is obtained and played.
- the voice play of the corresponding e-book content can be performed by the voice play instruction, and the “listening to book” function of the e-book reading application is realized.
- real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
- the above method according to an embodiment of the present invention may be implemented in hardware, firmware, or implemented as software or computer code that may be stored in a recording medium such as a CD ROM, a RAM, a floppy disk, a hard disk, or a magneto-optical disk, or implemented by
- the network downloads computer code originally stored in a remote recording medium or non-transitory machine readable medium and stored in a local recording medium so that the methods described herein can be stored using a general purpose computer, a dedicated processor or programmable
- Such software processing on a recording medium of dedicated hardware such as an ASIC or an FPGA.
- a computer, processor, microprocessor controller or programmable hardware includes storage components (eg, RAM, ROM, flash memory, etc.) that can store or receive software or computer code, when the software or computer code is The e-book voice playback method described herein is implemented when the processor or hardware accesses and executes. Moreover, when a general purpose computer accesses code for implementing the e-book voice playback method shown herein, execution of the code converts the general purpose computer into a special purpose computer for executing the electronic book voice playback method shown herein.
- storage components eg, RAM, ROM, flash memory, etc.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- User Interface Of Digital Computer (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Provided are an electronic-book voice playback method, apparatus, and terminal device; according to a voice playback instruction used for instructing an electronic book to perform voice playback, content of a to-be-played electronic book is determined (S102); real human voice audio of the content corresponding to the to-be-played electronic book is obtained, and the real human voice audio is played (S104). Thus the user is provided with a better "book listening" experience.
Description
交互参考Cross reference
本申请要求以下优先权:2017年7月21日提出的申请号:201710601433.6,名称:“电子书语音播放方法、装置及终端设备”的中国专利,本申请参考引用了如上所述申请的全部内容。The present application claims the following priority: Application No.: 201710601433.6, entitled "E-book Voice Play Method, Apparatus, and Terminal Equipment", filed on July 21, 2017, the entire contents of which are hereby incorporated by reference. .
本发明实施例涉及电子书数据处理技术领域,尤其涉及一种电子书语音播放方法、装置及终端设备。Embodiments of the present invention relate to the field of electronic book data processing technologies, and in particular, to an electronic book voice playing method, apparatus, and terminal device.
电子书是利用计算机技术将文字、图片、声音、影像等信息内容数字化的出版物。随着互联网技术应用的越来越广泛,传统的纸质阅读方式已逐渐被电子书取代,人们越来越趋向于利用互联网和计算机技术,通过用于阅读电子书的电子书阅读应用下载电子书进行阅读。An e-book is a publication that digitizes information such as text, pictures, sounds, and images using computer technology. With the increasing use of Internet technology, traditional paper reading methods have gradually been replaced by e-books. People are increasingly using Internet and computer technology to download e-books through e-book reading applications for reading e-books. Read it.
但随着智能终端技术的发展,人们对电子书阅读应用的要求也越来越高,比如,如何在眼睛疲劳或者光线不好的情况下也可以阅读自己感兴趣的电子书。因此,如何满足用户的这一需求就成为亟待解决的问题。However, with the development of smart terminal technology, people have higher and higher requirements for e-book reading applications. For example, how to read e-books of interest in the case of eye fatigue or poor light. Therefore, how to meet the needs of users has become an urgent problem to be solved.
发明内容Summary of the invention
有鉴于此,本发明实施例提供了一种电子书语音播放方法、装置及终端设备,以解决用户在眼睛疲劳或者光线不好的情况下阅读电子书的问题。In view of this, the embodiments of the present invention provide a method, a device, and a terminal device for playing an e-book voice, so as to solve the problem that the user reads the e-book under the condition of eye fatigue or poor light.
根据本发明实施例的一个方面,提供了一种电子书语音播放方法,包括:根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容;获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。According to an aspect of an embodiment of the present invention, a method for playing an e-book voice includes: determining an e-book content to be played by a voice according to a voice play instruction for instructing an e-book to perform voice playback; obtaining the e-book The content corresponds to the real vocal audio and plays the real vocal audio.
根据本发明实施例的另一个方面,还提供了一种电子书语音播放装 置,包括:内容确定模块,用于根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容;音频播放模块,用于获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。According to another aspect of the embodiments of the present invention, an electronic book voice playback apparatus is provided, including: a content determining module, configured to determine an e-book to be played by voice according to a voice play instruction for instructing an e-book to perform voice play And an audio playing module, configured to obtain real vocal audio corresponding to the e-book content, and play the real vocal audio.
根据本发明实施例的再一个方面,还提供了一种终端设备,包括:处理器、存储器、通信接口和通信总线,所述处理器、所述存储器和所述通信接口通过所述通信总线完成相互间的通信;所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行如上所述的电子书语音播放方法对应的操作。According to still another aspect of the embodiments of the present invention, a terminal device includes: a processor, a memory, a communication interface, and a communication bus, wherein the processor, the memory, and the communication interface are completed by using the communication bus Communication with each other; the memory is for storing at least one executable instruction that causes the processor to perform an operation corresponding to the e-book voice playback method as described above.
通过本发明实施例提供的电子书语音播放方案,在用户在眼睛疲劳或者光线不好的情况下,可以通过语音播放指令进行相应电子书内容的语音播放,实现了电子书阅读应用的“听书”功能。并且,本发明实施例中,使用真实人声音频,相比较于机器合成的音频,真实人声音频因为通过真实人声录制,其在语音语调以及流畅性方面都远优于机器合成,使得用户能够获得较好的“听书”体验。The e-book voice playing solution provided by the embodiment of the invention can perform the voice playing of the corresponding e-book content through the voice playing instruction in the case of the user's eye fatigue or poor light, thereby realizing the "listening" of the e-book reading application. "Features. Moreover, in the embodiment of the present invention, real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明实施例中记载的一些实施例,对于本领域普通技术人员来讲,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the description of the prior art will be briefly described below. Obviously, the drawings in the following description are only It is a few embodiments described in the embodiments of the present invention, and other drawings can be obtained according to the drawings for those skilled in the art.
图1是根据本发明实施例一的一种电子书语音播放方法的步骤流程图;1 is a flow chart showing the steps of a method for playing an e-book voice according to a first embodiment of the present invention;
图2是根据本发明实施例二的一种电子书语音播放方法的步骤流程图;2 is a flow chart showing the steps of a method for playing an e-book voice according to a second embodiment of the present invention;
图3是根据本发明实施例三的一种电子书语音播放装置的结构框图;3 is a block diagram showing the structure of an electronic book voice playing device according to a third embodiment of the present invention;
图4是根据本发明实施例四的一种电子书语音播放装置的结构框图;4 is a block diagram showing the structure of an electronic book voice playback apparatus according to Embodiment 4 of the present invention;
图5是根据本发明实施例五的一种终端设备的结构示意图。FIG. 5 is a schematic structural diagram of a terminal device according to Embodiment 5 of the present invention.
当然,实施本发明实施例的任一技术方案必不一定需要同时达到以上的所有优点。Of course, any technical solution of implementing the embodiments of the present invention necessarily does not necessarily need to achieve all the above advantages at the same time.
为了使本领域的人员更好地理解本发明实施例中的技术方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅是本发明实施例一部分实施例,而不是全部的实施例。基于本发明实施例中的实施例,本领域普通技术人员所获得的所有其他实施例,都应当属于本发明实施例保护的范围。For a better understanding of the technical solutions in the embodiments of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described in conjunction with the accompanying drawings in the embodiments of the present invention. The embodiments are only a part of the embodiments of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art should be within the scope of protection of the embodiments of the present invention based on the embodiments in the embodiments of the present invention.
实施例一Embodiment 1
参照图1,示出了根据本发明实施例一的一种电子书语音播放方法的步骤流程图。Referring to FIG. 1, a flow chart of steps of an e-book voice playing method according to a first embodiment of the present invention is shown.
本实施例的电子书语音播放方法包括以下步骤:The e-book voice playing method of this embodiment includes the following steps:
步骤S102:根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容。Step S102: Determine an e-book content to be played by the voice according to a voice play instruction for instructing the e-book to perform voice play.
其中,语音播放指令的生成可以以任意适当方式实现,包括但不限于:接收到用户对电子书界面中显示的语音播放按钮或选项的操作后生成,或者,接收到用户对显示的电子书页面的设定操作(如双击、单击、长按)后生成,或者,接收到用户通过相应的设置菜单进行语音播放设置后生成,等等,本发明实施例对此不作限制。The generation of the voice play instruction may be implemented in any suitable manner, including but not limited to: receiving the user's operation on the voice play button or option displayed in the e-book interface, or receiving the user's display of the e-book page. The setting operation (such as double-clicking, clicking, long-pressing) is generated, or is received after the user performs the voice playing setting through the corresponding setting menu, and the like, which is not limited by the embodiment of the present invention.
待语音播放的电子书内容可以是电子书阅读应用默认设置的内容,如当前显示的某一页电子书的全部内容,也可以是用户选择的一段或多段、一行或多行、一句或多句等内容。The content of the e-book to be played by the voice may be the content set by the e-book reading application, such as the entire content of the currently displayed e-book, or one or more segments, one or more lines, one or more sentences selected by the user. And so on.
步骤S104:获得与待语音播放的电子书内容相对应的真实人声音频,并播放该真实人声音频。Step S104: Obtain real vocal audio corresponding to the content of the e-book to be played by the voice, and play the real vocal audio.
在确定了待语音播放的电子书内容后,即可获得该电子书内容所对应的真实人声音频,进而进行播放。After the content of the e-book to be played by the voice is determined, the real vocal audio corresponding to the content of the e-book can be obtained, and then played.
其中,真实人声音频是真实的人的语音生成的音频,如,由真实的人朗读生成的音频、或由真实的人的对白生成的音频、或对真实的人声进行 处理后生成的音频(如对真实的人朗读过的句子进行重新拆分再合成等处理后生成的音频)等等。Among them, the real vocal audio is the voice generated by the real person's voice, such as audio generated by a real person reading aloud, or audio generated by a real person's dialogue, or audio generated by processing a real human voice. (such as the audio generated by re-splitting and re-synthesizing sentences that have been read by real people) and so on.
通过本实施例提供的电子书语音播放方案,在用户在眼睛疲劳或者光线不好的情况下,可以通过语音播放指令进行相应电子书内容的语音播放,实现了电子书阅读应用的“听书”功能。并且,本发明实施例中,使用真实人声音频,相比较于机器合成的音频,真实人声音频因为通过真实人声录制,其在语音语调以及流畅性方面都远优于机器合成,使得用户能够获得较好的“听书”体验。Through the e-book voice playing solution provided by the embodiment, when the user is tired or the light is bad, the user can perform the voice playing of the corresponding e-book content through the voice playing instruction, thereby realizing the "listening" of the e-book reading application. Features. Moreover, in the embodiment of the present invention, real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
本实施例的电子书语音播放方法可以由任意适当的具有数据处理能力的设备执行,包括但不限于:各种终端设备(包括PC机、平板电脑、移动终端等)和服务器等。The e-book voice playing method of this embodiment may be executed by any suitable device having data processing capability, including but not limited to: various terminal devices (including PCs, tablets, mobile terminals, etc.) and servers.
实施例二Embodiment 2
参照图2,示出了根据本发明实施例二的一种电子书语音播放方法的步骤流程图。Referring to FIG. 2, a flow chart of steps of an e-book voice playing method according to a second embodiment of the present invention is shown.
本实施例的电子书语音播放方法包括以下步骤:The e-book voice playing method of this embodiment includes the following steps:
步骤S202:根据指示对电子书进行语音播放的语音播放指令和对电子书的显示内容的选择操作,确定待语音播放的电子书内容。Step S202: Determine the e-book content to be played by the voice according to the voice play instruction for performing voice play on the electronic book and the selection operation of the display content of the electronic book.
用户在阅读电子书时,在某些情况下会有“听书”的需求,如眼睛疲劳或者光线不好或者其它情况等,在此情况下,电子书应用所在的设备在接收到相应的用户操作后,生成相应的语音播放指令,以指示对相应的电子书内容进行语音播放。其中,如实施例一中所述,待语音播放的电子书内容可以是电子书阅读应用默认设置的内容,也可以是用户选择的内容。本实施例中,以用户选择为例,对本发明实施例提供的电子书语音播放方案进行说明。When users read e-books, in some cases there will be a need to "listen to the book", such as eye strain or poor lighting or other conditions. In this case, the device where the e-book application is located receives the corresponding user. After the operation, a corresponding voice play instruction is generated to indicate that the corresponding e-book content is played by voice. As described in the first embodiment, the content of the e-book to be played by the voice may be the content set by the e-book reading application by default, or may be the content selected by the user. In this embodiment, an e-book voice play solution provided by an embodiment of the present invention is described by taking a user selection as an example.
由用户选择待语音播放的电子书内容时,用户可以选择电子书内容的某一段或某几段、某一行或某几行、某一句或某几句的内容等,通过该种方式,可以提高用户“听书”内容的灵活性,提升用户“听书”体验。但本领域技术人员应当明了,如实施例一中所述的电子书阅读应用默认设置 的待语音播放的电子书内容也可同样适用本实施例的方案。When the user selects the content of the e-book to be played by the voice, the user can select a certain segment or a certain segment of the content of the e-book, a certain line or a certain number of lines, the content of a certain sentence or a certain sentence, etc., by which the method can improve The flexibility of the user's "listening to the book" content enhances the user's "listening to the book" experience. However, it should be understood by those skilled in the art that the e-book content to be voice-played by default in the e-book reading application described in the first embodiment can also be applied to the solution of the embodiment.
需要说明的是,在实际应用中,用户指示语音播放的操作和用户选择电子书内容的操作可以采用任意适当的顺序。如,可以先通过适当方式指示进行语音播放,然后再选择电子书内容;也可以先选择电子书内容,再指示对选择的电子书内容进行语音播放。本实施例中,仅以后者为例对本发明实施例的方案进行说明,但本领域技术人员可以参照本实施例实现基于前一方式的电子书语音播放方案。It should be noted that, in practical applications, the operation of the user to indicate the voice play and the operation of the user to select the e-book content may be in any suitable order. For example, the voice playback may be first indicated by an appropriate method, and then the e-book content may be selected; or the e-book content may be selected first, and then the selected e-book content may be voice-played. In this embodiment, only the latter embodiment is taken as an example to describe the solution of the embodiment of the present invention. However, those skilled in the art can implement the e-book voice playing solution based on the previous mode by referring to the embodiment.
在采用先选择电子书内容,再指示对选择的电子书内容进行语音播放的方式中,可以先接收对电子书的显示内容的选择操作,根据选择操作确定待语音播放的电子书内容。In the manner of first selecting the e-book content and then instructing the voice content to be played on the selected e-book content, the selection operation of the display content of the e-book may be first received, and the e-book content to be played by the voice is determined according to the selection operation.
在一种可选方式中,可以接收用户对电子书的显示内容的第一操作,确定第一操作在所述显示内容中的第一作用点;接收用户对所述显示内容的第二操作,确定第二操作在所述显示内容中的第二作用点;将第一作用点和第二作用点之间的显示内容确定为待语音播放的电子书内容。其中,第一操作和第二操作包括但不限于点选操作。In an optional manner, a first operation of the display content of the electronic book by the user may be received, a first action point of the first operation in the display content is determined, and a second operation of the display content by the user is received, Determining a second action point of the second operation in the display content; determining display content between the first action point and the second action point as the e-book content to be played by the voice. The first operation and the second operation include, but are not limited to, a click operation.
在另一种可选方式中,可以接收用户对电子书的显示内容的第三操作,确定第三操作在所述显示内容中的第三作用点;以第三作用点为参考点,将包括第三作用点在内的第一设定范围内的显示内容确定为待语音播放的电子书内容;或者,将以第三作用点为起点的第二设定范围内的显示内容确定为待语音播放的电子书内容;或者,将以第三作用点为终点的第三设定范围内的显示内容确定为待语音播放的电子书内容。其中,第一设定范围、第二设定范围和第三设定范围可以相同也可以不同,可以由本领域技术人员根据实际需求设置。并且,在以第三作用点为参考点,将包括第三作用点在内的第一设定范围内的显示内容确定为待语音播放的电子书内容中,可以以第三作用点为终点,将第一设定范围内的显示内容确定为待语音播放的电子书内容,但不限于此,第三作用点也可以不为终点。第三操作包括但不限于点选操作。通过该种方式,简化了用户操作,减轻了系统操作负担。In another optional manner, the user may receive a third operation of the display content of the electronic book, determine a third action point of the third operation in the display content, and use the third action point as a reference point, which will include The display content in the first setting range including the third action point is determined as the electronic book content to be played by the voice; or the display content in the second setting range starting from the third action point is determined as the to-be-voiced The content of the e-book to be played; or, the content of the third setting range ending with the third point of action is determined as the content of the e-book to be played by the voice. The first setting range, the second setting range, and the third setting range may be the same or different, and may be set by a person skilled in the art according to actual needs. And, taking the third action point as a reference point, determining the display content in the first setting range including the third action point as the content of the electronic book to be played by the voice, and ending the third action point, The display content in the first setting range is determined as the content of the electronic book to be played by the voice, but is not limited thereto, and the third action point may not be the end point. The third operation includes, but is not limited to, a click operation. In this way, user operations are simplified and the operating burden of the system is reduced.
在再一种可选方式中,可以接收用户对电子书的显示内容的选择操 作,确定所述选择操作所选择的显示内容对应的内容标记;将所述内容标记所标记的内容确定为待语音播放的电子书内容。此种方式中,电子书内容中预先设置有相应的内容标记,该内容标记可以由本领域技术人员根据实际需求设置,如每一章或每一节设置一个内容标记,或者,每一页设置一个内容标记,或者,每一段设置一个内容标记,或者,根据对电子书内容的分析,每一个完整情节(如老师和学生在课堂上的对话情节)或每一个完整场景(如某个海上场景)设置一个内容标记,等等。在此情况下,当用户进行了选择操作,如,通过第一操作和第二操作的方式选择了某部分电子书内容;或者,在当前显示的电子书内容的任意位置进行了点击操作,如第三操作的方式;或者,当内容标记在电子书中以适当提示方式展示给用户,在用户对相应的提示进行操作后,电子书阅读应用会先确定对应的内容标记,进而,将该内容标记所标记的整部分电子书内容确定为待语音播放的电子书内容。In another optional manner, the user may receive a selection operation of the display content of the electronic book, determine a content tag corresponding to the display content selected by the selection operation, and determine the content marked by the content tag as the to-be-voiced voice. The content of the e-book played. In this manner, a corresponding content mark is preset in the e-book content, and the content mark can be set by a person skilled in the art according to actual needs, such as setting a content mark for each chapter or each section, or setting one for each page. Content tagging, or, each segment is set to a content tag, or, based on an analysis of the e-book content, each complete episode (such as the teacher and student's dialogue in the classroom) or each complete scene (such as a sea scene) Set a content tag, and more. In this case, when the user performs a selection operation, for example, a certain portion of the e-book content is selected by the first operation and the second operation; or, a click operation is performed at any position of the currently displayed e-book content, such as The third operation mode; or, when the content tag is displayed to the user in an appropriate prompt manner in the e-book, after the user operates the corresponding prompt, the e-book reading application first determines the corresponding content tag, and further, the content is The entire portion of the e-book content marked by the tag is determined as the e-book content to be played by the voice.
但不限于上述方式,在实际应用中,其它适当的确定待语音播放的电子书内容的方式也同样适用于本发明实施例的方案,如将电子书当前显示的整个页面的内容确定为待语音播放的电子书内容等。However, the method is not limited to the above manner. In an actual application, other suitable manners for determining the content of the e-book to be played by the voice are also applicable to the solution of the embodiment of the present invention, such as determining the content of the entire page currently displayed by the e-book as the to-be-voiced voice. The content of the e-book to be played, etc.
步骤S204:获得与待语音播放的电子书内容相对应的真实人声音频,并播放所述真实人声音频。Step S204: Obtain real vocal audio corresponding to the content of the e-book to be played by the voice, and play the real vocal audio.
其中,真实人声音频包括以下至少之一:从与电子书对应的影视剧中获取的影视台词音频;与电子书的电子书内容对应的朗读音频;电子书所在的电子书阅读应用的用户录制的用户音频。The real vocal audio includes at least one of the following: a film and television audio obtained from a movie drama corresponding to the electronic book; a spoken audio corresponding to the electronic book content of the electronic book; and a user recording of the electronic book reading application where the electronic book is located User audio.
例如,电子书“三生三世十里桃花”里的一句话“虽于我只是短短两个月,于你却是极漫长的一生,司命给你写的命格你有否看过?”,若用户选择了电子书中的这句话,或者语音播放至该处,则可以播放电视剧“三生三世十里桃花”中演员说的这句话,但不限于此,图书改编为影视作品后,可能原文与影视台词不能完全一致,也即,不能精确匹配,在此情况下,匹配度满足一定阈值或标准即可,该阈值或标准可以由本领域技术人员适当设置,本发明实施例对此不作限制。For example, in the e-book "Sansheng Sanshi Shili Peach Blossom", "I am only a short two months, but it is a very long life for you. Have you read the life that you wrote to you?" If the user selects the sentence in the e-book, or the voice is played there, the sentence of the actor in the TV series "Sansheng Sanshi Shili Peach Blossom" can be played, but it is not limited to this, after the book is adapted into a film and television work. The original text may not be exactly the same as the video file, that is, the exact match may not be performed. In this case, the matching degree satisfies a certain threshold or standard, and the threshold or standard may be appropriately set by a person skilled in the art. No restrictions.
又例如,电子书“三国演义”对应有真人原声原文朗读音频,则在此 情况下,可以确定与待语音播放的电子书内容对应的音频的起始位置,从该起始位置进行播放。For another example, the electronic book "Three Kingdoms" corresponds to the original sound of the original voice, and in this case, the start position of the audio corresponding to the content of the electronic book to be played by the voice can be determined, and the play is performed from the home position.
再例如,电子书阅读应用的用户自己朗读了电子书的全部或部分内容并录制成音频,或者,结合电子书内容进行语音评论并保存为音频,在该音频可被使用的情况下,如该音频被用户设置为共享、或发送给他人、或在电子书阅读应用中通过适当方式进行了发布,如,通过电子书评论发布或通过分享方式或通过其它适当方式发布等,则当用户自己语音播放该电子书内容,或者可获得该音频的他人对该电子书内容进行语音播放时,可使用该音频实现“听书”。此种方式中,在步骤S202根据电子书的语音播放指令,确定待语音播放的电子书内容之前,还可以接收用户通过电子书阅读应用为电子书的内容录制的朗读音频,将录制的音频和对应的电子书的内容关联存储;和/或,接收用户通过电子书阅读应用为电子书的内容录制的评论音频,将评论音频和对应的电子书的内容关联存储。基于录制和关联存储的用户录制的音频,实现“听书”功能,进一步提升用户使用电子书阅读应用的体验。For another example, the user of the e-book reading application reads all or part of the content of the e-book and records it into audio, or combines the e-book content for voice commenting and saving it as audio, in the case where the audio can be used, such as The audio is set by the user to be shared, or sent to others, or published in an appropriate way in an e-book reading application, such as by e-book comment posting or by sharing or by other appropriate means, etc. When the content of the e-book is played, or the other person who can obtain the audio plays the content of the e-book, the audio can be used to implement the "listening". In this manner, before determining the e-book content to be played by the voice according to the voice play instruction of the e-book, the user can also receive the spoken audio recorded by the user through the e-book reading application for the content of the e-book, and the recorded audio and The content of the corresponding e-book is stored in association; and/or, the user receives the comment audio recorded by the e-book reading application for the content of the e-book, and associates the comment audio with the content of the corresponding e-book. The "listening" function is realized based on the recorded audio of the user recorded and associated storage, further enhancing the user's experience of using the e-book reading application.
需要说明的是,在一种可选方案中,还可以对上述真实人声音频进行进一步的处理,如拆分后重新合成,以满足某些情形下的真实人声音频播放需要,如,影视台词的拆分和重新组合、朗读音频的拆分和重新组合、用户音频的拆分和重新组合等等,从而形成新的真实人声音频。It should be noted that, in an optional solution, the above-mentioned real vocal audio can be further processed, such as splitting and re-synthesizing to meet the real vocal audio playing needs in certain situations, such as video The splitting and recombination of lines, the splitting and recombination of audio readings, the splitting and recombination of user audio, etc., form new real vocal audio.
此外,真实人声音频还可以与背景音频和/或业务音频进行合成,生成合成音频,在此情况下,将获得与待语音播放的电子书内容相对应的合成音频,其中,合成音频除包括所述真实人声音频外,还包括背景音频和/或业务音频;进而播放该合成音频。其中,背景音频可以为背景音乐,通过背景音频可以进一步烘托气氛,使用户更能感受该部分电子书内容的气氛;业务音频可以为由当前真实人声音频中的人录制的业务音频,或者,是与待语音播放的电子书内容相关的业务音频,如情节相关的业务音频。业务音频可以插入在当前真实人声音频的开头、结尾、或者开头至结尾中任意适当的位置,可选地,业务音频可以实现为广告音频。In addition, the real vocal audio can also be synthesized with the background audio and/or the business audio to generate synthesized audio, in which case the synthesized audio corresponding to the electronic book content to be played by the voice will be obtained, wherein the synthesized audio includes In addition to the real vocal audio, background audio and/or service audio is also included; and the synthesized audio is played. The background audio can be background music, and the background audio can further enhance the atmosphere, so that the user can feel the atmosphere of the part of the e-book content; the service audio can be the business audio recorded by the person in the current real vocal audio, or It is a business audio related to the content of the e-book to be played by voice, such as a story-related business audio. The business audio can be inserted at any appropriate position at the beginning, end, or beginning to end of the current real vocal audio. Alternatively, the business audio can be implemented as an advertising audio.
在一种获得与待语音播放的电子书内容相对应的真实人声音频的实 现方式中,可以为电子书预先设置内容标记,也为真实人声音频预先设置音频标记。也即,电子书中预设有用于标记电子书内容的至少一个内容标记,真实人声音频中预设有用于标记音频内容的至少一个音频标记,基于此,可以根据所述内容标记与所述音频标记之间的对应关系,获得与电子书内容相对应的真实人声音频。In an implementation manner of obtaining real vocal audio corresponding to the content of the e-book to be played by the voice, the content tag may be pre-set for the e-book, and the audio tag may be pre-set for the real vocal audio. That is, at least one content tag for marking the content of the e-book is pre-set in the e-book, and at least one audio tag for marking the audio content is pre-set in the real vocal audio, based on which the content tag can be Correspondence between audio tags, obtaining real vocal audio corresponding to the contents of the e-book.
具体地,可以确定与待语音播放的电子书内容对应的内容标记;根据预存的内容标记与音频标记的对应关系,确定与所述内容标记对应的音频标记;获取与确定的所述音频标记相对应的音频内容。通过内容标记和音频标记的方式,可以快速、准确地获得与电子书内容相对应的真实人声音频,提高“听书”功能对用户操作的响应速度。Specifically, a content tag corresponding to the e-book content to be played by the voice may be determined; and an audio tag corresponding to the content tag is determined according to the correspondence between the pre-stored content tag and the audio tag; and the determined audio tag is obtained Corresponding audio content. Through the content mark and audio mark, the real vocal audio corresponding to the e-book content can be obtained quickly and accurately, and the response speed of the "listening" function to the user operation is improved.
在另一种获得与待语音播放的电子书内容相对应的真实人声音频的实现方式中,可以预先(如在根据指示对电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容的步骤之前)对已存在或已获取的真实人声音频进行语音识别,获得对应的文字内容;确定电子书中与文字内容相匹配的电子书内容;建立并存储所述文字内容对应的真实人声音频与确定的电子书内容之间的对应关系。例如,对一段时长为30分钟的影视台词音频进行语音识别,获得对应的多段文字内容;进而,将该多段文字内容分别与电子书内容进行匹配,根据匹配结果确定该多段文字内容与电子书中的多段内容之间的对应关系;进而,可以根据二者之间的关系,建立并存储语音识别出的多段文字内容对应的真实人声音频中的多个部分与多段电子书内容之间的对应关系。基于此,在获得与待语音播放的电子书内容相对应的真实人声音频时,可以根据该对应关系,获得与待语音播放的电子书内容相对应的真实人声音频。In another implementation manner of obtaining real vocal audio corresponding to the content of the e-book to be played by the voice, the e-book to be played by the voice may be determined in advance (for example, in a voice play instruction for performing voice play on the e-book according to the indication) Before the step of content) performing voice recognition on the existing or acquired real vocal audio, obtaining the corresponding text content; determining the e-book content in the e-book that matches the text content; establishing and storing the real content corresponding to the text content The correspondence between the vocal audio and the determined e-book content. For example, voice recognition is performed on a piece of video audio of a period of 30 minutes, and corresponding multi-segment text content is obtained; further, the multi-segment text content is respectively matched with the e-book content, and the multi-segment text content and the e-book are determined according to the matching result. Corresponding relationship between multiple pieces of content; further, according to the relationship between the two, the correspondence between the plurality of parts of the real vocal audio corresponding to the plurality of pieces of text content recognized by the speech and the contents of the plurality of pieces of e-book contents can be established and stored relationship. Based on this, when the real vocal audio corresponding to the e-book content to be played by the voice is obtained, the real vocal audio corresponding to the e-book content to be played by the voice can be obtained according to the correspondence.
此外,在一种可选方式中,如果真实人声音频包括多种,如,包括影视台词音频、电子书内容朗读音频和用户音频中的至少两个时,在获得与电子书内容相对应的真实人声音频时,可以按照预设的优先级,从影视台词音频、电子书内容朗读音频和用户音频中的至少两个中,获得与电子书内容相对应的真实人声音频;或者,也可以接收用户对影视台词音频、电子书内容朗读音频和用户音频中的至少两个对应的选项的选择操作,获得 所述选择操作所选择的与电子书内容相对应的真实人声音频;或者,也可以根据用户播放真实人声音频的历史数据,确定用户的音频类型偏好;根据用户的音频类型偏好,从影视台词音频、电子书内容朗读音频和用户音频中的至少两个中,获得与待语音播放的电子书内容相对应的真实人声音频。如,用户的历史数据表明该用户有过十次的语音播放记录,其中,八次使用了影视台词音频,则在用户再次进行语音播放时,可以直接使用影视台词音频进行相应的电子书内容的语音播放。In addition, in an optional manner, if the real vocal audio includes a plurality of, for example, at least two of the audio-visual line audio, the e-book content reading audio, and the user audio, obtaining the content corresponding to the e-book content In real vocal audio, real vocal audio corresponding to the e-book content can be obtained from at least two of the audio and video audio, the e-book content reading audio, and the user audio according to a preset priority; or Receiving, by the user, a selection operation of at least two corresponding options of the audio-visual line audio, the e-book content reading audio, and the user audio, and obtaining the real vocal audio corresponding to the e-book content selected by the selecting operation; or The user may also determine the audio type preference of the user according to the historical data of the user playing the real vocal audio; according to the user's audio type preference, at least two of the audio and video audio, the e-book content reading audio, and the user audio are obtained and to be obtained. The real vocal audio corresponding to the e-book content of the voice playback. For example, the user's historical data indicates that the user has had ten voice playback records. Among them, the audio and video audio is used eight times. When the user plays the voice again, the audio and video audio can be directly used to perform the corresponding e-book content. Voice playback.
又例如,在第一种方式中,假设设置影视台词音频、电子书内容朗读音频和用户音频这三种音频的优先级从高到低依次为:用户音频、影视台词音频、电子书内容朗读音频。则当电子书的某部分文字同时对应有这三种音频时,则播放用户音频;而如果电子书的某部分文字仅对应有其中的部分音频时,如对应有影视台词音频和电子书内容朗读音频,则根据该优先级将播放影视台词音频,而若该部分文字仅对应有电子书内容朗读音频,则将播放该电子书内容朗读音频。需要说明的是,上述优先级设置仅为示例性说明,本领域技术人员可以根据实际需要适当设置,本发明实施例对此不作限制。通过设置优先级,既最大可能地保证了电子书文字对应有音频,又使得音频的形式多样化。For another example, in the first mode, it is assumed that the priority of setting three types of audio such as audio and video audio, e-book content reading audio, and user audio is from high to low: user audio, audio and video audio, and e-book content reading audio. . When a certain part of the text of the e-book corresponds to the three kinds of audio at the same time, the user audio is played; and if a part of the text of the e-book only corresponds to some of the audio, for example, the audio and electronic contents of the electronic book and the e-book content are read aloud. Audio, according to the priority level will play the film and television audio, and if the part of the text only corresponds to the e-book content reading audio, the e-book content will be played aloud audio. It should be noted that the foregoing priority setting is only an exemplary description, and may be appropriately set by a person skilled in the art according to actual needs, which is not limited by the embodiment of the present invention. By setting the priority, it is possible to ensure, as much as possible, that the e-book text corresponds to audio, and the form of the audio is diversified.
而通过第二种方式,为用户选择与电子书内容对应的真实人声音频提供了更大的灵活性,用户可以自行选择音频进而进行播放。其中,影视台词音频、电子书内容朗读音频和用户音频对应的选项可以由本领域技术人员根据实际需求适当设置,在一种可选的实现方式中,可以通过弹窗或者透明覆盖层显示影视台词音频、电子书内容朗读音频和用户音频对应的选项。例如,当接收到对某部分电子书内容进行语音播放的语音播放指令后,电子书应用通过弹窗或者透明覆盖层向用户展示相应的音频选项供用户选择,在得到用户的选择结果后,播放与该选择结果对应的真实人声音频,如,用户选择了影视台词音频,则播放与该部分电子书内容对应的影视台词音频。基于显示电子书内容的界面,通过弹窗或透明覆盖层显示音频选项,方便了用户操作,提升了用户使用体验。In the second way, the user is provided with greater flexibility in selecting the real vocal audio corresponding to the e-book content, and the user can select the audio and play it. The options corresponding to the audio and video audio, the e-book content reading audio, and the user audio may be appropriately set by a person skilled in the art according to actual needs. In an optional implementation manner, the audio and video audio may be displayed through a pop-up window or a transparent overlay. The e-book content reads the audio and user audio options. For example, after receiving a voice play instruction for performing voice playback on a part of the e-book content, the e-book application presents a corresponding audio option to the user through a pop-up window or a transparent overlay layer for the user to select, and after playing the user's selection result, playing The real vocal audio corresponding to the selection result, for example, if the user selects the film and television word audio, the audio and video audio corresponding to the part of the electronic book content is played. Based on the interface for displaying the content of the e-book, the audio option is displayed through the pop-up window or the transparent overlay layer, which facilitates the user's operation and improves the user experience.
通过上述过程,实现了电子书内容的“听书”功能,在此基础上,可 选地,还可以进一步进行下述步骤S206或步骤S208的操作。Through the above process, the "listening" function of the e-book content is realized. On the basis of this, optionally, the following operations of step S206 or step S208 can be further performed.
步骤S206:在播放真实人声音频的过程中,接收到对电子书的翻页操作,暂停所述真实人声音频的播放;根据翻页操作重新确定待语音播放的电子书内容;获得与重新确定的电子书内容相对应的真实人声音频并播放。Step S206: in the process of playing the real vocal audio, receiving the page turning operation of the e-book, suspending the playing of the real vocal audio; re-determining the e-book content to be played by the voice according to the page turning operation; obtaining and re-creating The actual vocal audio corresponding to the determined e-book content is played and played.
在某一真实人声音频播放过程中,有可能该音频还未播放完,用户即进行了相应的操作,如上翻页或下翻页操作,电子书阅读应用在监测到音频播放过程中的翻页操作后,会自动暂停该音频的播放;进一步地,根据该翻页操作重新确定待语音播放的电子书内容,如,确定该翻页操作最终的目标页面,进而根据该目标页面的内容重新确定待语音播放的电子书内容。In the process of playing a real vocal audio, it is possible that the audio has not been played yet, and the user has performed corresponding operations, such as page turning or page turning, and the e-book reading application is monitored during the audio playback process. After the page operation, the playing of the audio is automatically suspended; further, the e-book content to be played by the voice is re-determined according to the page turning operation, for example, determining the final target page of the page turning operation, and then re-creating according to the content of the target page. Determine the content of the e-book to be played.
一种可能的情况下,假设当前真实人声音频正在播放电子书第5页第三段的第一句话的内容,此时,用户进行了连续的下翻页操作,最后停在了电子书页面的第10页,此种情况下,可以停止之前的音频,转而播放第10页的电子书内容的真实人声音频(如第10页的首个电子书的内容标记对应的音频,或者,第10页的起始文字对应的音频,或者,第10页的情节或者场景对应的音频等等);也可以停止之前的音频,接收用户对第10页的电子书内容的选择操作后,播放该次选择操作所选择的电子书内容对应的真实人声音频。上翻页操作与下翻页操作类似,在此不再赘述。In a possible case, suppose that the current real vocal audio is playing the content of the first sentence of the third paragraph of the fifth page of the e-book. At this time, the user performs a continuous page turning operation, and finally stops at the e-book. Page 10 of the page, in this case, you can stop the previous audio and play the real vocal audio of the e-book content on page 10 (such as the audio corresponding to the content tag of the first e-book on page 10, or , the audio corresponding to the start text on page 10, or the audio of the scene on page 10 or the scene, etc.); it is also possible to stop the previous audio and receive the user's selection of the e-book content on page 10, The real vocal audio corresponding to the e-book content selected by the selection operation is played. The page turning operation is similar to the page turning operation, and will not be described here.
另一种可能的情况下,假设当前真实人声音频正在播放电子书第5页第三段的第一句话的内容,此时,用户进行了连续的下翻页操作,翻至电子书页面的第10页后又进行了上翻页操作,翻回至电子书页面的第5页,此种情况下,则可以继续之前中断的真实人声音频的播放。但不限于此,也可以停止之前的音频,重新确定第5页的电子书内容对应的真实人声音频,如,第5页的首个电子书的内容标记对应的音频,或者,第5页的起始文字对应的音频,或者,第5页的情节或者场景对应的音频等等。但继续之前中断的真实人声音频的播放的方式相较于其它方式,更接近于用户“听书”的真实需求,提升用户“听书”体验。In another possible case, suppose that the current real vocal audio is playing the content of the first sentence of the third paragraph of the fifth page of the e-book. At this time, the user performs a continuous page turning operation and turns to the e-book page. After the 10th page, the page flip operation is performed again, and the page 5 is turned back to the e-book page. In this case, the playback of the real vocal audio that was previously interrupted can be continued. However, it is not limited thereto, and the previous audio may be stopped, and the real vocal audio corresponding to the e-book content of the fifth page may be re-determined, for example, the audio corresponding to the content mark of the first e-book of the fifth page, or, page 5 The initial text corresponds to the audio, or the episode on page 5 or the audio corresponding to the scene, and so on. However, the way of continuing the playback of the real vocal audio before the interruption is closer to the real needs of the user "listening to the book" than the other methods, and improving the user's "listening to the book" experience.
当然,实际应用中,若在播放真实人声音频的过程中,接收到对电子 书的翻页操作,也可以停止真实人声音频的播放,等待用户的下一个语音播放指令。Of course, in the actual application, if the page turning operation of the electronic book is received during the process of playing the real vocal audio, the playing of the real vocal audio can be stopped, and the user's next voice playing instruction is awaited.
步骤S208:在播放真实人声音频的过程中,接收对播放的真实人声音频的音频处理指令,对真实人声音频进行音频处理指令所指示的操作。Step S208: In the process of playing the real vocal audio, receiving an audio processing instruction for the played real vocal audio, and performing an operation indicated by the audio processing instruction on the real vocal audio.
其中,音频处理指令包括但不限于以下至少之一:用于指示暂停真实人声音频播放的暂停指令、用于指示调整真实人声音频的播放速度的第一调整指令、用于指示调整真实人声音频的播放进度的第二调整指令、用于指示退出真实人声音频播放的退出指令、用于指示切换真实人声音频的类型的切换指令。The audio processing instruction includes, but is not limited to, at least one of: a pause instruction for instructing suspension of real human voice audio playback, a first adjustment instruction for indicating a playback speed of adjusting real human voice audio, and an instruction for adjusting the real person. A second adjustment instruction of the playback progress of the audio and audio, an exit instruction for instructing the exit of the real human voice audio, and a switching instruction for indicating the type of switching the real human voice audio.
例如,用户在通过真实人声音频“听书”过程中,需要离开终端设备时,可以通过操作“暂停”或类似操作选项向电子书阅读应用发送暂停指令,暂停当前音频的播放;或者,当检测到用户中断了电子书阅读应用转而使用其它应用时,电子书阅读应用可以自动生成相应的暂停指令,指示暂停当前音频的播放。For example, when the user needs to leave the terminal device through the real vocal audio “listening to the book”, the user may send a pause instruction to the e-book reading application by operating the “pause” or the similar operation option to pause the playing of the current audio; or, when When it is detected that the user interrupts the e-book reading application and uses other applications, the e-book reading application can automatically generate a corresponding pause instruction to suspend the playback of the current audio.
又例如,用户需要终止音频播放时,可以通过操作“停止”或类似操作选项向电子书阅读应用发送于指示退出真实人声音频播放的退出指令,以停止当前真实人声音频的播放。For another example, when the user needs to terminate the audio playback, the user may send an exit instruction indicating that the real human voice audio is exited to the e-book reading application by operating a “stop” or the like operation option to stop the playing of the current real human voice audio.
再例如,如前所述,当真实人声音频包括影视台词音频、电子书内容朗读音频和用户音频中的至少两个时,用户可以通过对显示的其它音频类型的选择操作,或者,通过“切换人声”或类似操作选项向电子书阅读应用发送指示切换真实人声音频的类型的切换指令。如,当前真实人声音频为用户音频,用户通过对“切换人声”操作选项的操作,从显示的多种音频类型中选择一个类型进行切换,例如,将用户音频切换为影视台词音频或者电子书内容朗读音频。For another example, as described above, when the real vocal audio includes at least two of the audio-visual audio, the e-book content reading audio, and the user audio, the user may perform a selection operation on other audio types displayed, or by " A switch vocal" or similar operation option sends a switch instruction to the e-book reading application indicating the type of switching real vocal audio. For example, the current real vocal audio is user audio, and the user selects one of a plurality of displayed audio types by the operation of the “switch vocal” operation option, for example, switching the user audio to the audio and video audio or electronic The contents of the book read the audio.
又例如,用户希望调整音频的播放速度,则可以通过相应的播放速度调整操作选项,向电子书阅读应用发送指示调整真实人声音频的播放速度的第一调整指令,以调整当前音频的播放速度。如,用户选择了“2倍速”播放,则当前真实人声音频的播放速度将调整为原播放速度的2倍。For another example, if the user wants to adjust the playing speed of the audio, the first adjusting instruction for adjusting the playing speed of the real vocal audio can be sent to the e-book reading application through the corresponding playing speed adjusting operation option to adjust the playing speed of the current audio. . For example, if the user selects “2x speed” playback, the playback speed of the current real vocal audio will be adjusted to 2 times of the original playback speed.
再例如,用户希望快进或快退音频,则可以通过相应的播放进度调整 操作选项,向电子书阅读应用发送指示调整真实人声音频的播放进度的第二调整指令。如,用户可以通过点选“快进”或类似操作选项,或者,通过拖动音频播放进度条,进行当前真实人声音频的播放进度的调整。For another example, if the user wishes to fast forward or rewind the audio, the second adjustment instruction indicating that the playback progress of the real vocal audio is adjusted may be sent to the e-book reading application through the corresponding play progress adjustment operation option. For example, the user can adjust the playing progress of the current real vocal audio by clicking the “fast forward” or similar operation option, or by dragging the audio playback progress bar.
需要说明的是,上述音频处理指令可以由本领域技术人员通过任意适当的设置实现,在一种可选方式中,可以通过悬浮图标或悬浮窗口或透明覆盖层,显示上述音频处理指令。通过这种显示音频处理指令的方式,一方面,尽可能地减小了显示的音频处理指令对用户阅读电子书造成的影响;另一方向,也使得用户对音频的控制和处理更为便利,提升了用户“听书”体验。It should be noted that the foregoing audio processing instructions may be implemented by any suitable setting by those skilled in the art. In an optional manner, the audio processing instructions may be displayed by a floating icon or a floating window or a transparent overlay. By means of displaying the audio processing instructions, on the one hand, the influence of the displayed audio processing instructions on the user's reading of the e-book is reduced as much as possible; and the other direction makes the user's control and processing of the audio more convenient. Improved user "listening" experience.
通过本实施例提供的电子书语音播放方案,在用户在眼睛疲劳或者光线不好的情况下,可以通过语音播放指令进行相应电子书内容的语音播放,实现了电子书阅读应用的“听书”功能。并且,本发明实施例中,使用真实人声音频,相比较于机器合成的音频,真实人声音频因为通过真实人声录制,其在语音语调以及流畅性方面都远优于机器合成,使得用户能够获得较好的“听书”体验。Through the e-book voice playing solution provided by the embodiment, when the user is tired or the light is bad, the user can perform the voice playing of the corresponding e-book content through the voice playing instruction, thereby realizing the "listening" of the e-book reading application. Features. Moreover, in the embodiment of the present invention, real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
本实施例的电子书语音播放方法可以由任意适当的具有数据处理能力的设备执行,包括但不限于:各种终端设备(包括PC机、平板电脑、移动终端等)和服务器等。The e-book voice playing method of this embodiment may be executed by any suitable device having data processing capability, including but not limited to: various terminal devices (including PCs, tablets, mobile terminals, etc.) and servers.
实施例三Embodiment 3
参照图3,示出了根据本发明实施例三的一种电子书语音播放装置的结构框图。Referring to FIG. 3, a block diagram of a structure of an electronic book voice playback apparatus according to a third embodiment of the present invention is shown.
本实施例的电子书语音播放装置包括:内容确定模块302,用于根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容;音频播放模块304,用于获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。The e-book voice playing device of the embodiment includes: a content determining module 302, configured to determine an e-book content to be played by the voice according to a voice playing instruction for instructing the e-book to perform voice playing; and an audio playing module 304, configured to obtain and The e-book content corresponds to real vocal audio and plays the real vocal audio.
通过本实施例提供的电子书语音播放装置,在用户在眼睛疲劳或者光线不好的情况下,可以通过语音播放指令进行相应电子书内容的语音播放,实现了电子书阅读应用的“听书”功能。并且,本发明实施例中,使用真实人声音频,相比较于机器合成的音频,真实人声音频因为通过真实 人声录制,其在语音语调以及流畅性方面都远优于机器合成,使得用户能够获得较好的“听书”体验。Through the e-book voice playing device provided by the embodiment, when the user is tired or the light is bad, the user can perform the voice playing of the corresponding e-book content through the voice playing instruction, thereby realizing the "listening" of the e-book reading application. Features. Moreover, in the embodiment of the present invention, real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
实施例四Embodiment 4
参照图4,示出了根据本发明实施例四的一种电子书语音播放装置的结构框图。Referring to FIG. 4, a block diagram of a structure of an electronic book voice playback apparatus according to a fourth embodiment of the present invention is shown.
本实施例的电子书语音播放装置包括:内容确定模块402,用于根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容;音频播放模块404,用于获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。The e-book voice playing device of the embodiment includes: a content determining module 402, configured to determine an e-book content to be played by the voice according to a voice playing instruction for instructing the e-book to perform voice playing; and an audio playing module 404, configured to obtain The e-book content corresponds to real vocal audio and plays the real vocal audio.
可选地,真实人声音频包括以下至少之一:从与电子书对应的影视剧中获取的影视台词音频;与电子书的电子书内容对应的朗读音频;电子书所在的电子书阅读应用的用户录制的用户音频。Optionally, the real vocal audio includes at least one of: a film and television audio obtained from a movie drama corresponding to the electronic book; a reading audio corresponding to the electronic book content of the electronic book; and an e-book reading application where the electronic book is located User audio recorded by the user.
可选地,音频播放模块404用于获得与待播放的电子书内容相对应的合成音频,其中,合成音频除包括真实人声音频外,还包括背景音频和/或业务音频;以及用于播放所述合成音频。Optionally, the audio playing module 404 is configured to obtain synthesized audio corresponding to the electronic book content to be played, wherein the synthesized audio includes background audio and/or service audio in addition to the real human voice audio; and is used for playing The synthesized audio.
可选地,电子书中预设有用于标记电子书内容的至少一个内容标记,真实人声音频中预设有用于标记音频内容的至少一个音频标记;音频播放模块404,用于根据所述内容标记与所述音频标记之间的对应关系,获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。Optionally, at least one content tag for marking the content of the e-book is pre-set in the e-book, and at least one audio tag for marking the audio content is pre-set in the real vocal audio; the audio playing module 404 is configured to Marking a correspondence relationship with the audio mark, obtaining real vocal audio corresponding to the electronic book content, and playing the real vocal audio.
可选地,音频播放模块404用于确定与待语音播放的电子书内容对应的内容标记;根据预存的内容标记与音频标记的对应关系,确定与所述内容标记对应的音频标记;获取与确定的所述音频标记相对应的音频内容,并播放所述音频内容。Optionally, the audio play module 404 is configured to determine a content mark corresponding to the electronic book content to be played by the voice; determine an audio mark corresponding to the content mark according to the corresponding relationship between the pre-stored content mark and the audio mark; acquire and determine The audio tag corresponds to the audio content and plays the audio content.
可选地,本实施例的电子书语音播放装置还包括:建立关系模块406,用于在内容确定模块402根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容之前,对真实人声音频进行语音识别,获得对应的文字内容;确定电子书中与所述文字内容相匹配的电子书内容;建立并存储所述文字内容对应的真实人声音频与确定的电子书内容之 间的对应关系;音频播放模块404用于根据所述文字内容对应的真实人声音频与确定的所述电子书内容之间的对应关系,获得与待语音播放的电子书内容相对应的真实人声音频,并播放所述真实人声音频。Optionally, the e-book voice playing device of the embodiment further includes: a relationship establishing module 406, configured to determine, according to the voice playing instruction for instructing the e-book to perform voice playing, the content of the e-book to be played by the voice Previously, speech recognition is performed on real vocal audio to obtain corresponding text content; e-book content matching the text content in the e-book is determined; real vocal audio corresponding to the text content and determined electronic are established and stored Corresponding relationship between the contents of the book; the audio playing module 404 is configured to obtain, according to the correspondence between the real vocal audio corresponding to the text content and the determined content of the electronic book, corresponding to the content of the electronic book to be played by the voice Real vocal audio and play the real vocal audio.
可选地,当真实人声音频包括影视台词音频、电子书内容朗读音频和用户音频中的至少两个时,音频播放模块404用于按照预设的优先级,从影视台词音频、电子书内容朗读音频和用户音频中的至少两个中,获得与电子书内容相对应的真实人声音频,并播放所述真实人声音频;或者,音频播放模块404用于接收用户对影视台词音频、电子书内容朗读音频、和用户音频中的至少两个对应的选项的选择操作,获得所述选择操作所选择的、与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频;或者,音频播放模块404用于根据用户播放真实人声音频的历史数据,确定用户的音频类型偏好;根据用户的音频类型偏好,从影视台词音频、电子书内容朗读音频和用户音频中的至少两个中,获得与待语音播放的电子书内容相对应的真实人声音频,并播放所述真实人声音频。Optionally, when the real vocal audio includes at least two of the audio-visual audio, the e-book content reading audio, and the user audio, the audio playing module 404 is configured to use the audio and television content and the e-book content according to the preset priority. Reading at least two of the audio and the user audio, obtaining real vocal audio corresponding to the e-book content, and playing the real vocal audio; or, the audio playing module 404 is configured to receive the user's audio and video a book content reading audio, and a selection operation of at least two corresponding options in the user audio, obtaining real vocal audio selected by the selection operation corresponding to the e-book content, and playing the real vocal Audio; or, the audio playing module 404 is configured to determine the user's audio type preference according to the historical data of the user playing the real vocal audio; according to the user's audio type preference, from the audio and video audio, the e-book content, the audio and the user audio In at least two, real vocal audio corresponding to the content of the e-book to be played by the voice is obtained, and played Said real human voice audio.
可选地,本实施例的电子书语音播放装置还包括:显示模块408,用于在音频播放模块404接收用户对影视台词音频、电子书内容朗读音频和用户音频中的至少两个对应的选项的选择操作之前,通过弹窗或者透明覆盖层显示影视台词音频、电子书内容朗读音频、和用户音频中的至少两个对应的选项。Optionally, the electronic book voice playing device of the embodiment further includes: a display module 408, configured to receive, by the audio playing module 404, at least two corresponding options of the user for the audio and video audio, the electronic book content reading audio, and the user audio. Before the selection operation, at least two corresponding options of the audio and video audio, the e-book content reading audio, and the user audio are displayed through a pop-up window or a transparent overlay.
可选地,内容确定模块402用于根据指示对电子书进行语音播放的语音播放指令和对电子书的显示内容的选择操作,确定待语音播放的电子书内容。Optionally, the content determining module 402 is configured to determine, according to the voice play instruction for performing voice play on the electronic book and the selection operation of the display content of the electronic book, the electronic book content to be played by the voice.
可选地,本实施例的电子书语音播放装置还包括:内容选择模块410,用于在内容确定模块402根据用于指示电子书进行语音播放的语音播放指令和对电子书的显示内容的选择操作,确定待语音播放的电子书内容之前,接收对电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容。Optionally, the electronic book voice playing device of the embodiment further includes: a content selection module 410, configured to select, according to the voice playing instruction for instructing the electronic book to perform voice playing, and the selection content of the electronic book in the content determining module 402 The operation, before determining the content of the electronic book to be played by the voice, receives a selection operation of the display content of the electronic book, and determines the content of the electronic book to be played by the voice according to the selection operation.
可选地,内容选择模块410包括:第一选择模块4102,用于接收用户对电子书的显示内容的第一操作,确定第一操作在显示内容中的第一作 用点;接收用户对显示内容的第二操作,确定第二操作在所述显示内容中的第二作用点;将第一作用点和第二作用点之间的显示内容确定为待语音播放的电子书内容。Optionally, the content selection module 410 includes: a first selection module 4102, configured to receive a first operation of the display content of the electronic book by the user, determine a first action point of the first operation in the display content, and receive the display content of the user The second operation determines a second action point of the second operation in the display content; determining display content between the first action point and the second action point as the e-book content to be played by the voice.
可选地,内容选择模块410包括:第二选择模块4104,用于接收用户对电子书的显示内容的第三操作,确定第三操作在所述显示内容中的第三作用点;以第三作用点为参考点,将包括第三作用点在内的第一设定范围内的显示内容确定为待语音播放的电子书内容;或者,将以第三作用点为起点的第二设定范围内的显示内容确定为待语音播放的电子书内容;或者,将以第三作用点为终点的第三设定范围内的显示内容确定为待语音播放的电子书内容。Optionally, the content selection module 410 includes: a second selection module 4104, configured to receive a third operation of the display content of the electronic book by the user, determine a third action point of the third operation in the display content; The action point is a reference point, and the display content in the first setting range including the third action point is determined as the content of the e-book to be played by the voice; or the second setting range starting from the third action point The display content inside is determined as the e-book content to be played by the voice; or the display content in the third setting range ending with the third action point is determined as the e-book content to be played by the voice.
可选地,内容选择模块410包括:第三选择模块4106,用于接收用户对电子书的显示内容的选择操作,确定所述选择操作所选择的显示内容对应的内容标记;将所述内容标记所标记的内容确定为待语音播放的电子书内容。Optionally, the content selection module 410 includes: a third selection module 4106, configured to receive a user's selection operation on the display content of the electronic book, determine a content tag corresponding to the display content selected by the selection operation, and mark the content The marked content is determined as the e-book content to be played by the voice.
可选地,本实施例的电子书语音播放装置还包括:录制存储模块412,用于在内容确定模块402根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容之前,接收用户通过电子书阅读应用为电子书的内容录制的朗读音频,将录制的音频和对应的电子书的内容关联存储;和/或,接收用户通过电子书阅读应用为电子书的内容录制的评论音频,将评论音频和对应的电子书的内容关联存储。Optionally, the e-book voice playing device of the embodiment further includes: a recording storage module 412, configured to determine, according to the voice playing instruction for instructing the e-book to perform voice playing, the content determining module 402 to determine the e-book content to be played by the voice Previously, receiving the spoken audio recorded by the user through the e-book reading application for the content of the e-book, associating the recorded audio with the content of the corresponding e-book; and/or receiving the user recording the content of the e-book through the e-book reading application The comment audio stores the comment audio associated with the content of the corresponding e-book.
可选地,本实施例的电子书语音播放装置还包括:音频处理模块414,用于接收对播放的真实人声音频的音频处理指令,对所述真实人声音频进行所述音频处理指令所指示的操作。Optionally, the e-book voice playback device of this embodiment further includes: an audio processing module 414, configured to receive an audio processing instruction for the played real vocal audio, and perform the audio processing instruction on the real vocal audio Indicated action.
可选地,所述音频处理指令包括以下至少之一:用于指示暂停所述真实人声音频播放的暂停指令、用于指示调整所述真实人声音频的播放速度的第一调整指令、用于指示调整所述真实人声音频的播放进度的第二调整指令、用于指示退出所述真实人声音频播放的退出指令、用于指示切换真实人声音频的类型的切换指令。Optionally, the audio processing instruction includes at least one of: a pause instruction for instructing suspension of the real human voice audio playback, a first adjustment instruction for indicating a playback speed of the real human voice audio, And a second adjustment instruction indicating that the playback progress of the real vocal audio is adjusted, an exit instruction for instructing to exit the real vocal audio play, and a switching instruction for indicating a type of switching the real vocal audio.
可选地,显示模块408还用于通过悬浮图标或悬浮窗口或透明覆盖 层,显示所述音频处理指令。Optionally, the display module 408 is further configured to display the audio processing instruction by using a floating icon or a floating window or a transparent overlay.
可选地,本实施例的电子书语音播放装置还包括:重确定模块416,用于在播放真实人声音频的过程中,接收到对电子书的翻页操作,暂停所述真实人声音频的播放;根据所述翻页操作重新确定待语音播放的电子书内容;获得与重新确定的所述电子书内容相对应的真实人声音频并播放。Optionally, the e-book voice playback device of the embodiment further includes: a re-determination module 416, configured to receive a page turning operation on the e-book during the process of playing the real vocal audio, and suspend the real vocal audio Playback; re-determine the e-book content to be played by the voice according to the page turning operation; obtain real vocal audio corresponding to the re-determined e-book content and play.
本实施例的电子书语音播放装置用于实现前述多个方法实施例中相应的电子书语音播放方法,并具有相应的方法实施例的有益效果,在此不再赘述。The e-book voice playback device of the present embodiment is used to implement the corresponding e-book voice playback method in the foregoing multiple method embodiments, and has the beneficial effects of the corresponding method embodiments, and details are not described herein again.
实施例五Embodiment 5
参照图5,示出了根据本发明实施例五的一种终端设备的结构示意图,本发明具体实施例并不对终端设备的具体实现做限定。Referring to FIG. 5, a schematic structural diagram of a terminal device according to Embodiment 5 of the present invention is shown. The specific implementation of the present invention does not limit the specific implementation of the terminal device.
如图5所示,该终端设备可以包括:处理器(processor)502、通信接口(Communications Interface)504、存储器(memory)506、以及通信总线508。As shown in FIG. 5, the terminal device may include a processor 502, a communications interface 504, a memory 506, and a communication bus 508.
其中:among them:
处理器502、通信接口504、以及存储器506通过通信总线508完成相互间的通信。Processor 502, communication interface 504, and memory 506 complete communication with one another via communication bus 508.
通信接口504,用于与其它设备比如其它终端设备或服务器等的网元通信。The communication interface 504 is configured to communicate with network elements of other devices, such as other terminal devices or servers.
处理器502,用于执行程序510,具体可以执行上述电子书语音播放方法实施例中的相关步骤。The processor 502 is configured to execute the program 510, and specifically, the related steps in the foregoing embodiment of the electronic book voice playing method.
具体地,程序510可以包括程序代码,该程序代码包括计算机操作指令。In particular, program 510 can include program code, the program code including computer operating instructions.
处理器502可能是中央处理器CPU,或者是特定集成电路ASIC(Application Specific Integrated Circuit),或者是被配置成实施本发明实施例的一个或多个集成电路。终端设备包括的一个或多个处理器,可以是同一类型的处理器,如一个或多个CPU;也可以是不同类型的处理器,如一个或多个CPU以及一个或多个ASIC。The processor 502 may be a central processing unit CPU, or an Application Specific Integrated Circuit (ASIC), or one or more integrated circuits configured to implement embodiments of the present invention. The one or more processors included in the terminal device may be the same type of processor, such as one or more CPUs; or may be different types of processors, such as one or more CPUs and one or more ASICs.
存储器506,用于存放程序510。存储器506可能包含高速RAM存 储器,也可能还包括非易失性存储器(non-volatile memory),例如至少一个磁盘存储器。The memory 506 is configured to store the program 510. Memory 506 may include high speed RAM memory and may also include non-volatile memory, such as at least one disk memory.
程序510具体可以用于使得处理器502执行以下操作:根据指示对电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容;获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。The program 510 may be specifically configured to cause the processor 502 to: determine the e-book content to be played by the voice according to the voice play instruction indicating the voice play of the e-book; and obtain the real vocal audio corresponding to the e-book content. And play the real vocal audio.
在一种可选的实施方式中,真实人声音频包括以下至少之一:从与电子书对应的影视剧中获取的影视台词音频;与电子书的电子书内容对应的朗读音频;电子书所在的电子书阅读应用的用户录制的用户音频。In an optional implementation manner, the real vocal audio includes at least one of: audio and video audio obtained from a movie drama corresponding to the electronic book; reading audio corresponding to the electronic book content of the electronic book; User audio recorded by the user of the e-book reading application.
在一种可选的实施方式中,程序510还用于使得处理器502在获得与待播放的电子书内容相对应的真实人声音频,并播放所述真实人声音频时,获得与待播放的电子书内容相对应的合成音频,其中,所述合成音频除包括所述真实人声音频外,还包括背景音频和/或业务音频;播放所述合成音频。In an optional implementation manner, the program 510 is further configured to enable the processor 502 to obtain and play the real vocal audio corresponding to the e-book content to be played, and play the real vocal audio. The synthesized audio corresponding to the e-book content, wherein the synthesized audio includes background audio and/or service audio in addition to the real vocal audio; playing the synthesized audio.
在一种可选的实施方式中,电子书中预设有用于标记电子书内容的至少一个内容标记,真实人声音频中预设有用于标记音频内容的至少一个音频标记;程序510还用于使得处理器502在获得与所述电子书内容相对应的真实人声音频时,根据所述内容标记与所述音频标记之间的对应关系,获得与所述电子书内容相对应的真实人声音频。In an optional implementation, at least one content tag for marking the content of the e-book is pre-set in the e-book, and at least one audio tag for marking the audio content is pre-set in the real vocal audio; the program 510 is also used to When the processor 502 obtains the real vocal audio corresponding to the e-book content, according to the correspondence between the content tag and the audio tag, obtaining a real vocal corresponding to the e-book content Audio.
在一种可选的实施方式中,程序510还用于使得处理器502在根据所述内容标记与所述音频标记之间的对应关系,获得与所述电子书内容相对应的真实人声音频时,确定与待语音播放的电子书内容对应的内容标记;根据预存的内容标记与音频标记的对应关系,确定与所述内容标记对应的音频标记;获取与确定的所述音频标记相对应的音频内容。In an optional implementation, the program 510 is further configured to enable the processor 502 to obtain real vocal audio corresponding to the e-book content according to the correspondence between the content tag and the audio tag. Determining a content tag corresponding to the content of the e-book to be played by the voice; determining an audio tag corresponding to the content tag according to the correspondence between the pre-stored content tag and the audio tag; acquiring the audio tag corresponding to the determined Audio content.
在一种可选的实施方式中,程序510还用于使得处理器502在根据指示对电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容之前,对真实人声音频进行语音识别,获得对应的文字内容;确定所述电子书中与所述文字内容相匹配的电子书内容;建立并存储所述文字内容对应的真实人声音频与确定的所述电子书内容之间的对应关系;程序510还用于使得处理器502在获得与待播放的电子书内容相对应的真实人声音 频时,根据所述对应关系,获得与待播放的电子书内容相对应的真实人声音频。In an optional implementation manner, the program 510 is further configured to cause the processor 502 to perform voice on the real vocal audio before determining the e-book content to be played by the voice according to the voice play instruction for performing the voice play on the electronic book according to the indication. Identifying, obtaining corresponding text content; determining e-book content in the e-book that matches the text content; establishing and storing between the real vocal audio corresponding to the text content and the determined content of the e-book Corresponding relationship; the program 510 is further configured to: when the processor 502 obtains the real vocal audio corresponding to the e-book content to be played, obtain the real vocal corresponding to the e-book content to be played according to the correspondence relationship Audio.
在一种可选的实施方式中,当真实人声音频包括影视台词音频、电子书内容朗读音频、和用户音频中的至少两个时,程序510还用于使得处理器502在获得与所述电子书内容相对应的真实人声音频时,按照预设的优先级,从影视台词音频、电子书内容朗读音频、和用户音频中的至少两个中,获得与所述电子书内容相对应的真实人声音频;或者,接收用户对影视台词音频、电子书内容朗读音频、和用户音频中的至少两个对应的选项的选择操作,获得所述选择操作所选择的、与所述电子书内容相对应的真实人声音频或者,根据用户播放真实人声音频的历史数据,确定用户的音频类型偏好;根据用户的音频类型偏好,从影视台词音频、电子书内容朗读音频、和用户音频中的至少两个中,获得与待播放的电子书内容相对应的真实人声音频。In an optional implementation, when the real vocal audio includes at least two of the audio-visual audio, the e-book content reading audio, and the user audio, the program 510 is further configured to cause the processor 502 to obtain and When the real vocal audio corresponding to the e-book content corresponds to at least two of the audio-visual line audio, the e-book content reading audio, and the user audio, the content corresponding to the e-book content is obtained according to a preset priority. Real vocal audio; or, receiving a user's selection operation of at least two corresponding options of the audio-visual line audio, the e-book content reading audio, and the user audio, and obtaining the e-book content selected by the selection operation Corresponding real vocal audio or, according to the historical data of the user playing real vocal audio, determining the user's audio type preference; according to the user's audio type preference, from the audio and video audio, the e-book content reading audio, and the user audio In at least two, real vocal audio corresponding to the content of the e-book to be played is obtained.
在一种可选的实施方式中,程序510还用于使得处理器502在接收用户对影视台词音频、电子书内容朗读音频、和用户音频中的至少两个对应的选项的选择操作之前,通过弹窗或者透明覆盖层显示影视台词音频、电子书内容朗读音频、和用户音频中的至少两个对应的选项。In an optional implementation, the program 510 is further configured to cause the processor 502 to pass the user's selection operation of the at least two corresponding options of the station audio, the e-book content reading audio, and the user audio. The pop-up window or transparent overlay displays options corresponding to at least two of the audio and video audio, the e-book content reading audio, and the user audio.
在一种可选的实施方式中,程序510还用于使得处理器502在根据指示对电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容时,根据指示对电子书进行语音播放的语音播放指令和对电子书的显示内容的选择操作,确定待语音播放的电子书内容。In an optional implementation manner, the program 510 is further configured to: when the processor 502 determines the e-book content to be played by the voice according to the voice play instruction for performing the voice play on the electronic book according to the instruction, perform voice on the e-book according to the indication. The played voice play command and the selection operation of the display content of the e-book determine the content of the e-book to be played by the voice.
在一种可选的实施方式中,程序510还用于使得处理器502在根据指示对电子书进行语音播放的语音播放指令和对电子书的显示内容的选择操作,确定待语音播放的电子书内容之前,接收对电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容。In an optional implementation manner, the program 510 is further configured to: determine, by the processor 502, a voice play instruction for performing voice play on the electronic book according to the indication and a selection operation on the display content of the electronic book, and determine an electronic book to be played by the voice. Before the content, a selection operation of the display content of the electronic book is received, and the electronic book content to be played by the voice is determined according to the selection operation.
在一种可选的实施方式中,程序510还用于使得处理器502在接收对电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容时,接收用户对电子书的显示内容的第一操作,确定第一操作在所述显示内容中的第一作用点;接收用户对所述显示内容的第二操作,确定 第二操作在所述显示内容中的第二作用点;将第一作用点和第二作用点之间的显示内容确定为待语音播放的电子书内容。In an optional implementation manner, the program 510 is further configured to: when the processor 502 receives the selection operation of the display content of the electronic book, and determines the electronic book content to be played by the voice according to the selecting operation, receiving the user to the electronic a first operation of displaying content of the book, determining a first action point of the first operation in the display content; receiving a second operation of the display content by the user, determining a second operation of the second operation in the display content The action point; the display content between the first action point and the second action point is determined as the content of the e-book to be played by the voice.
在一种可选的实施方式中,程序510还用于使得处理器502在接收对电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容时,接收用户对电子书的显示内容的第三操作,确定第三操作在所述显示内容中的第三作用点;以第三作用点为参考点,将包括第三作用点在内的第一设定范围内的显示内容确定为待语音播放的电子书内容;或者,将以第三作用点为起点的第二设定范围内的显示内容确定为待语音播放的电子书内容;或者,将以第三作用点为终点的第三设定范围内的显示内容确定为待语音播放的电子书内容。In an optional implementation manner, the program 510 is further configured to: when the processor 502 receives the selection operation of the display content of the electronic book, and determines the electronic book content to be played by the voice according to the selecting operation, receiving the user to the electronic a third operation of displaying the content of the book, determining a third action point of the third operation in the display content; using the third action point as a reference point, the first set range including the third action point The display content is determined as the e-book content to be played by the voice; or the display content in the second setting range starting from the third action point is determined as the e-book content to be played by the voice; or, the third action point is to be The display content in the third setting range of the end point is determined as the content of the e-book to be played by the voice.
在一种可选的实施方式中,程序510还用于使得处理器502在接收对电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容时,接收用户对电子书的显示内容的选择操作,确定所述选择操作所选择的显示内容对应的内容标记;将所述内容标记所标记的内容确定为待语音播放的电子书内容。In an optional implementation manner, the program 510 is further configured to: when the processor 502 receives the selection operation of the display content of the electronic book, and determines the electronic book content to be played by the voice according to the selecting operation, receiving the user to the electronic a selection operation of the display content of the book, determining a content tag corresponding to the display content selected by the selection operation; and determining the content marked by the content tag as the e-book content to be played by the voice.
在一种可选的实施方式中,程序510还用于使得处理器502在根据电子书的语音播放指令,确定待语音播放的电子书内容之前,接收用户通过电子书阅读应用为电子书的内容录制的朗读音频,将录制的音频和对应的电子书的内容关联存储;和/或,接收用户通过电子书阅读应用为电子书的内容录制的评论音频,将评论音频和对应的电子书的内容关联存储。In an optional implementation manner, the program 510 is further configured to: when the processor 502 determines the e-book content to be played by the voice according to the voice play instruction of the e-book, receive the content that the user reads the application into the e-book through the e-book. Recording aloud audio, storing the recorded audio in association with the content of the corresponding e-book; and/or receiving the comment audio recorded by the user through the e-book reading application for the content of the e-book, and the content of the comment audio and the corresponding e-book Associate storage.
在一种可选的实施方式中,程序510还用于使得处理器502接收对播放的真实人声音频的音频处理指令,对真实人声音频进行所述音频处理指令所指示的操作。In an alternative embodiment, the program 510 is further configured to cause the processor 502 to receive an audio processing instruction for the played real vocal audio, the real vocal audio being subjected to the operation indicated by the audio processing instruction.
在一种可选的实施方式中,音频处理指令包括以下至少之一:用于指示暂停真实人声音频播放的暂停指令、用于指示调整真实人声音频的播放速度的第一调整指令、用于指示调整真实人声音频的播放进度的第二调整指令、用于指示退出真实人声音频播放的退出指令、用于指示切换真实人声音频的类型的切换指令。In an optional implementation manner, the audio processing instruction includes at least one of: a pause instruction for instructing suspension of real human voice audio playback, a first adjustment instruction for indicating a playback speed of adjusting real human voice audio, a second adjustment instruction for instructing adjustment of the playback progress of the real vocal audio, an exit instruction for indicating the exit of the real vocal audio playback, and a switching instruction for indicating the type of switching the real vocal audio.
在一种可选的实施方式中,程序510还用于使得处理器502通过悬浮 图标或悬浮窗口或透明覆盖层,显示所述音频处理指令。In an alternative embodiment, the program 510 is further configured to cause the processor 502 to display the audio processing instructions via a floating icon or a floating window or a transparent overlay.
在一种可选的实施方式中,程序510还用于使得处理器502在播放真实人声音频的过程中,接收到对电子书的翻页操作,暂停真实人声音频的播放;根据所述翻页操作重新确定待语音播放的电子书内容;获得与重新确定的电子书内容相对应的真实人声音频并播放。In an optional implementation manner, the program 510 is further configured to enable the processor 502 to receive a page turning operation on the e-book during the playing of the real human voice audio, and suspend the playing of the real human voice audio; The page turning operation redetermines the content of the e-book to be played by the voice; the real vocal audio corresponding to the content of the re-determined e-book is obtained and played.
程序510中各步骤的具体实现可以参见上述电子书语音播放方法实施例中的相应步骤和单元中对应的描述,在此不赘述。所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的设备和模块的具体工作过程,可以参考前述方法实施例中的对应过程描述,在此不再赘述。For the specific implementation of the steps in the program 510, reference may be made to the corresponding steps in the foregoing embodiment of the e-book voice playing method and the corresponding description in the unit, and details are not described herein. A person skilled in the art can clearly understand that, for the convenience and brevity of the description, the specific working process of the device and the module described above may be referred to the corresponding process description in the foregoing method embodiment, and details are not described herein again.
通过本实施例,在用户在眼睛疲劳或者光线不好的情况下,可以通过语音播放指令进行相应电子书内容的语音播放,实现了电子书阅读应用的“听书”功能。并且,本发明实施例中,使用真实人声音频,相比较于机器合成的音频,真实人声音频因为通过真实人声录制,其在语音语调以及流畅性方面都远优于机器合成,使得用户能够获得较好的“听书”体验。Through the embodiment, in the case that the user is tired or the light is not good, the voice play of the corresponding e-book content can be performed by the voice play instruction, and the “listening to book” function of the e-book reading application is realized. Moreover, in the embodiment of the present invention, real vocal audio is used, and compared with the machine-synthesized audio, the real vocal audio is far superior to the machine synthesis in terms of voice intonation and fluency because of recording through real vocals, so that the user Can get a better "listening" experience.
需要指出,根据实施的需要,可将本发明实施例中描述的各个部件/步骤拆分为更多部件/步骤,也可将两个或多个部件/步骤或者部件/步骤的部分操作组合成新的部件/步骤,以实现本发明实施例的目的。It should be noted that the various components/steps described in the embodiments of the present invention may be split into more components/steps according to the needs of the implementation, or two or more components/steps or partial operations of the components/steps may be combined into one. New components/steps to achieve the objectives of embodiments of the present invention.
上述根据本发明实施例的方法可在硬件、固件中实现,或者被实现为可存储在记录介质(诸如CD ROM、RAM、软盘、硬盘或磁光盘)中的软件或计算机代码,或者被实现通过网络下载的原始存储在远程记录介质或非暂时机器可读介质中并将被存储在本地记录介质中的计算机代码,从而在此描述的方法可被存储在使用通用计算机、专用处理器或者可编程或专用硬件(诸如ASIC或FPGA)的记录介质上的这样的软件处理。可以理解,计算机、处理器、微处理器控制器或可编程硬件包括可存储或接收软件或计算机代码的存储组件(例如,RAM、ROM、闪存等),当所述软件或计算机代码被计算机、处理器或硬件访问且执行时,实现在此描述的电子书语音播放方法。此外,当通用计算机访问用于实现在此示出的电子书语音播放方法的代码时,代码的执行将通用计算机转换为用于执行在此示出的电子书语音播放方法的专用计算机。The above method according to an embodiment of the present invention may be implemented in hardware, firmware, or implemented as software or computer code that may be stored in a recording medium such as a CD ROM, a RAM, a floppy disk, a hard disk, or a magneto-optical disk, or implemented by The network downloads computer code originally stored in a remote recording medium or non-transitory machine readable medium and stored in a local recording medium so that the methods described herein can be stored using a general purpose computer, a dedicated processor or programmable Such software processing on a recording medium of dedicated hardware such as an ASIC or an FPGA. It will be understood that a computer, processor, microprocessor controller or programmable hardware includes storage components (eg, RAM, ROM, flash memory, etc.) that can store or receive software or computer code, when the software or computer code is The e-book voice playback method described herein is implemented when the processor or hardware accesses and executes. Moreover, when a general purpose computer accesses code for implementing the e-book voice playback method shown herein, execution of the code converts the general purpose computer into a special purpose computer for executing the electronic book voice playback method shown herein.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及方法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本发明实施例的范围。Those of ordinary skill in the art will appreciate that the elements and method steps of the various examples described in connection with the embodiments disclosed herein can be implemented in electronic hardware or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the solution. A person skilled in the art can use different methods to implement the described functions for each particular application, but such implementation should not be considered to be beyond the scope of the embodiments of the invention.
以上实施方式仅用于说明本发明实施例,而并非对本发明实施例的限制,有关技术领域的普通技术人员,在不脱离本发明实施例的精神和范围的情况下,还可以做出各种变化和变型,因此所有等同的技术方案也属于本发明实施例的范畴,本发明实施例的专利保护范围应由权利要求限定。The above embodiments are only used to illustrate the embodiments of the present invention, and are not intended to limit the embodiments of the present invention, and those skilled in the art can also make various kinds without departing from the spirit and scope of the embodiments of the present invention. Variations and modifications, therefore, all equivalent technical solutions are also within the scope of the embodiments of the present invention, and the scope of patent protection of the embodiments of the present invention should be defined by the claims.
Claims (37)
- 一种电子书语音播放方法,包括:An electronic book voice playing method includes:根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容;Determining the content of the e-book to be played by the voice according to the voice play instruction for instructing the e-book to perform voice playback;获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。Real vocal audio corresponding to the content of the e-book is obtained, and the real vocal audio is played.
- 根据权利要求1所述的方法,其中,所述真实人声音频包括以下至少之一:The method of claim 1 wherein said real vocal audio comprises at least one of:从与所述电子书对应的影视剧中获取的影视台词音频;a film and television audio obtained from a film and television drama corresponding to the e-book;与所述电子书的电子书内容对应的朗读音频;Reading audio corresponding to the e-book content of the e-book;所述电子书所在的电子书阅读应用的用户录制的用户音频。The user audio recorded by the user of the e-book reading application in which the e-book is located.
- 根据权利要求1所述的方法,其中,获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频,包括:The method of claim 1, wherein obtaining real vocal audio corresponding to the e-book content and playing the real vocal audio comprises:获得与所述电子书内容相对应的合成音频,其中,所述合成音频包括所述真实人声音频,且还包括背景音频和/或业务音频;Obtaining synthesized audio corresponding to the e-book content, wherein the synthesized audio includes the real vocal audio, and further comprising background audio and/or service audio;播放所述合成音频。Playing the synthesized audio.
- 根据权利要求1-3任一项所述的方法,其中,所述电子书中预设有用于标记电子书内容的至少一个内容标记,所述真实人声音频中预设有用于标记音频内容的至少一个音频标记;The method according to any one of claims 1 to 3, wherein at least one content mark for marking the content of the electronic book is pre-set in the electronic book, and the real vocal audio is pre-set with a mark for the audio content. At least one audio tag;所述获得与所述电子书内容相对应的真实人声音频,包括:The obtaining real vocal audio corresponding to the content of the e-book includes:根据所述内容标记与所述音频标记之间的对应关系,获得与所述电子书内容相对应的真实人声音频。Real vocal audio corresponding to the e-book content is obtained according to a correspondence between the content tag and the audio tag.
- 根据权利要求4所述的方法,其中,根据所述内容标记与所述音频标记之间的对应关系,获得与所述电子书内容相对应的真实人声音频,包括:The method according to claim 4, wherein the real vocal audio corresponding to the e-book content is obtained according to the correspondence between the content tag and the audio tag, comprising:确定与所述待语音播放的电子书内容对应的内容标记;Determining a content tag corresponding to the e-book content to be played by the voice;根据预存的内容标记与音频标记的对应关系,确定与所述内容标记对应的音频标记;Determining an audio tag corresponding to the content tag according to a correspondence between the pre-stored content tag and the audio tag;获取与确定的所述音频标记相对应的音频内容。Acquiring audio content corresponding to the determined audio tag.
- 根据权利要求1-3任一项所述的方法,其中,A method according to any one of claims 1 to 3, wherein在所述根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容之前,所述方法还包括:Before the determining the e-book content to be played by the voice according to the voice play instruction for instructing the e-book to perform the voice play, the method further includes:对真实人声音频进行语音识别,获得对应的文字内容;Perform speech recognition on real vocal audio to obtain corresponding text content;确定所述电子书中与所述文字内容相匹配的电子书内容;Determining an e-book content in the e-book that matches the text content;建立并存储所述文字内容对应的真实人声音频与确定的所述电子书内容之间的对应关系;Establishing and storing a correspondence between the real vocal audio corresponding to the text content and the determined content of the e-book;所述获得与所述电子书内容相对应的真实人声音频,包括:根据所述文字内容对应的真实人声音频与确定的所述电子书内容之间的对应关系,获得与所述电子书内容相对应的真实人声音频。The obtaining the real vocal audio corresponding to the e-book content includes: obtaining the e-book according to the correspondence between the real vocal audio corresponding to the text content and the determined content of the e-book content The real vocal audio corresponding to the content.
- 根据权利要求2所述的方法,其中,当所述真实人声音频包括所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个时,The method according to claim 2, wherein when said real vocal audio includes at least two of said movie audio, said electronic book content reading audio, and said user audio,所述获得与所述电子书内容相对应的真实人声音频,包括:The obtaining real vocal audio corresponding to the content of the e-book includes:按照预设的优先级,从所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个中,获得与所述电子书内容相对应的真实人声音频;Obtaining real vocal audio corresponding to the e-book content from at least two of the film and television word audio, the e-book content reading audio, and the user audio according to a preset priority;或者,or,接收用户对所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个对应的选项的选择操作,获得所述选择操作所选择的与所述电子书内容相对应的真实人声音频;Receiving, by the user, a selection operation of at least two corresponding options of the audio-visual line audio, the e-book content reading audio, and the user audio, obtaining a selection corresponding to the e-book content selected by the selecting operation Real vocal audio;或者,or,根据用户播放真实人声音频的历史数据,确定用户的音频类型偏好;根据所述用户的音频类型偏好,从所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个中,获得与所述电子书内容相对应的真实人声音频。Determining a user's audio type preference according to the historical data of the user playing the real vocal audio; at least two of the audiovisual vocabulary audio, the e-book content reading audio, and the user audio according to the user's audio type preference Among them, real vocal audio corresponding to the content of the e-book is obtained.
- 根据权利要求7所述的方法,其中,在所述接收用户对所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个对应的选 项的选择操作之前,所述方法还包括:The method of claim 7, wherein prior to said selecting operation of said user for at least two of said television station word audio, said e-book content reading audio, and said user audio, said The method also includes:通过弹窗或者透明覆盖层显示所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个对应的选项。At least two corresponding options of the audio-visual line audio, the e-book content reading audio, and the user audio are displayed through a pop-up window or a transparent overlay.
- 根据权利要求1所述的方法,其中,所述根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容,包括:The method of claim 1, wherein the determining the e-book content to be played by the voice according to the voice play instruction for instructing the e-book to perform the voice play comprises:根据用于指示电子书进行语音播放的语音播放指令和对所述电子书的显示内容的选择操作,确定待语音播放的电子书内容。The e-book content to be played by the voice is determined according to a voice play instruction for instructing the e-book to perform voice play and a selection operation of the display content of the e-book.
- 根据权利要求9所述的方法,其中,在所述根据用于指示电子书进行语音播放的语音播放指令和对所述电子书的显示内容的选择操作,确定待语音播放的电子书内容之前,所述方法还包括:The method according to claim 9, wherein before determining the e-book content to be voice-played based on the voice play instruction for instructing the e-book to perform voice playback and the selection operation on the display content of the e-book, The method further includes:接收对所述电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容。Receiving a selection operation of the display content of the electronic book, and determining an electronic book content to be played by the voice according to the selection operation.
- 根据权利要求10所述的方法,其中,所述接收对所述电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容,包括:The method according to claim 10, wherein the receiving a selection operation of the display content of the electronic book, and determining the content of the electronic book to be played by the voice according to the selecting operation comprises:接收用户对所述电子书的显示内容的第一操作,确定所述第一操作在所述显示内容中的第一作用点;Receiving a first operation of the display content of the electronic book by the user, determining a first action point of the first operation in the display content;接收用户对所述显示内容的第二操作,确定所述第二操作在所述显示内容中的第二作用点;Receiving a second operation of the display content by the user, determining a second action point of the second operation in the display content;将所述第一作用点和所述第二作用点之间的显示内容确定为待语音播放的电子书内容。The display content between the first action point and the second action point is determined as the e-book content to be played by the voice.
- 根据权利要求10所述的方法,其中,所述接收对所述电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容,包括:The method according to claim 10, wherein the receiving a selection operation of the display content of the electronic book, and determining the content of the electronic book to be played by the voice according to the selecting operation comprises:接收用户对所述电子书的显示内容的第三操作,确定所述第三操作在所述显示内容中的第三作用点;Receiving a third operation of the display content of the e-book by the user, determining a third action point of the third operation in the display content;以所述第三作用点为参考点,将包括所述第三作用点在内的第一设定范围内的显示内容确定为待语音播放的电子书内容;或者,将以所述第三作用点为起点的第二设定范围内的显示内容确定为待语音播放的电子书 内容;或者,将以所述第三作用点为终点的第三设定范围内的显示内容确定为待语音播放的电子书内容。Taking the third action point as a reference point, determining the display content in the first setting range including the third action point as the content of the electronic book to be played by the voice; or, The display content in the second setting range whose point is the starting point is determined as the electronic book content to be played by the voice; or the display content in the third setting range ending in the third acting point is determined to be the voice to be played. E-book content.
- 根据权利要求10所述的方法,其中,所述接收对所述电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容,包括:The method according to claim 10, wherein the receiving a selection operation of the display content of the electronic book, and determining the content of the electronic book to be played by the voice according to the selecting operation comprises:接收用户对所述电子书的显示内容的选择操作,确定所述选择操作所选择的显示内容对应的内容标记;Receiving a selection operation of the display content of the e-book by the user, and determining a content tag corresponding to the display content selected by the selection operation;将所述内容标记所标记的内容确定为待语音播放的电子书内容。The content marked by the content tag is determined as the e-book content to be played by the voice.
- 根据权利要求1所述的方法,其中,在所述根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容之前,所述方法还包括:The method according to claim 1, wherein before the determining the e-book content to be played by the voice according to the voice play instruction for instructing the e-book to perform voice playback, the method further comprises:接收用户通过电子书阅读应用为所述电子书的内容录制的朗读音频,将录制的所述音频和对应的所述电子书的内容关联存储;Receiving, by the e-book reading application, the spoken audio recorded for the content of the e-book, and storing the recorded audio and the content of the corresponding e-book in association;和/或,and / or,接收用户通过电子书阅读应用为所述电子书的内容录制的评论音频,将所述评论音频和对应的所述电子书的内容关联存储。Receiving, by the e-book reading application, the comment audio recorded for the content of the e-book, and storing the comment audio and the content of the corresponding e-book in association.
- 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1 wherein the method further comprises:接收对播放的所述真实人声音频的音频处理指令,对所述真实人声音频进行所述音频处理指令所指示的操作。Receiving an audio processing instruction for the played real vocal audio, performing an operation indicated by the audio processing instruction on the real vocal audio.
- 根据权利要求15所述的方法,其中,所述音频处理指令包括以下至少之一:用于指示暂停所述真实人声音频播放的暂停指令、用于指示调整所述真实人声音频的播放速度的第一调整指令、用于指示调整所述真实人声音频的播放进度的第二调整指令、用于指示退出所述真实人声音频播放的退出指令、用于指示切换所述真实人声音频的类型的切换指令。The method of claim 15, wherein the audio processing instruction comprises at least one of: a pause command for instructing suspension of the real human voice audio playback, for indicating adjustment of a playback speed of the real human voice audio a first adjustment instruction, a second adjustment instruction for instructing adjustment of a playback progress of the real vocal audio, an exit instruction for instructing to exit the real vocal audio play, for indicating switching the real vocal audio The type of switching instruction.
- 根据权利要求15或16所述的方法,其中,所述方法还包括:The method of claim 15 or 16, wherein the method further comprises:通过悬浮图标或悬浮窗口或透明覆盖层,显示所述音频处理指令。The audio processing instructions are displayed by a hovering icon or a floating window or a transparent overlay.
- 根据权利要求1所述的方法,其中,所述方法还包括:The method of claim 1 wherein the method further comprises:在播放所述真实人声音频的过程中,接收到对所述电子书的翻页操作,暂停所述真实人声音频的播放;In the process of playing the real vocal audio, receiving a page turning operation on the electronic book, suspending playing of the real vocal audio;根据所述翻页操作重新确定待语音播放的电子书内容;Re-determining the content of the e-book to be played by the voice according to the page turning operation;获得与重新确定的所述电子书内容相对应的真实人声音频并播放。Real vocal audio corresponding to the re-determined content of the e-book is obtained and played.
- 一种电子书语音播放装置,包括:An electronic book voice playing device, comprising:内容确定模块,用于根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容;a content determining module, configured to determine an e-book content to be played by the voice according to a voice playing instruction for instructing the e-book to perform voice playing;音频播放模块,用于获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。And an audio playing module, configured to obtain real vocal audio corresponding to the e-book content, and play the real vocal audio.
- 根据权利要求19所述的装置,其中,所述真实人声音频包括以下至少之一:The apparatus of claim 19, wherein the real vocal audio comprises at least one of:从与所述电子书对应的影视剧中获取的影视台词音频;a film and television audio obtained from a film and television drama corresponding to the e-book;与所述电子书的电子书内容对应的朗读音频;Reading audio corresponding to the e-book content of the e-book;所述电子书所在的电子书阅读应用的用户录制的用户音频。The user audio recorded by the user of the e-book reading application in which the e-book is located.
- 根据权利要求19所述的装置,其中,所述音频播放模块,用于获得与所述电子书内容相对应的合成音频,其中,所述合成音频包括所述真实人声音频,且还包括背景音频和/或业务音频;以及,用于播放所述合成音频。The apparatus according to claim 19, wherein said audio playback module is configured to obtain synthesized audio corresponding to said electronic book content, wherein said synthesized audio comprises said real human voice audio, and further comprising a background Audio and/or business audio; and, for playing the synthesized audio.
- 根据权利要求19-21任一项所述的装置,其中,所述电子书中预设有用于标记电子书内容的至少一个内容标记,所述真实人声音频中预设有用于标记音频内容的至少一个音频标记;The apparatus according to any one of claims 19 to 21, wherein at least one content mark for marking the contents of the electronic book is pre-set in the electronic book, and the real vocal audio is preliminarily provided for marking the audio content. At least one audio tag;所述音频播放模块,用于根据所述内容标记与所述音频标记之间的对应关系,获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。The audio playing module is configured to obtain real vocal audio corresponding to the electronic book content according to a correspondence between the content mark and the audio mark, and play the real vocal audio.
- 根据权利要求22所述的装置,其中,所述音频播放模块,用于确定与所述待语音播放的电子书内容对应的内容标记;根据预存的内容标记与音频标记的对应关系,确定与所述内容标记对应的音频标记;获取与确定的所述音频标记相对应的音频内容,并播放所述音频内容。The device according to claim 22, wherein the audio playing module is configured to determine a content mark corresponding to the electronic book content to be voice-played; and determine the corresponding relationship according to the corresponding relationship between the pre-stored content mark and the audio mark Depicting an audio tag corresponding to the content tag; acquiring audio content corresponding to the determined audio tag, and playing the audio content.
- 根据权利要求19-21任一项所述的装置,其中,A device according to any one of claims 19-21, wherein所述装置还包括:建立关系模块,用于在所述内容确定模块根据用于 指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容之前,对真实人声音频进行语音识别,获得对应的文字内容;确定所述电子书中与所述文字内容相匹配的电子书内容;建立并存储所述文字内容对应的真实人声音频与确定的所述电子书内容之间的对应关系;The device further includes: a relationship establishing module, configured to perform voice recognition on the real vocal audio before the content determining module determines the content of the electronic book to be played by the voice according to the voice playing instruction for instructing the electronic book to perform voice playing Obtaining corresponding text content; determining e-book content in the e-book that matches the text content; establishing and storing a correspondence between the real vocal audio corresponding to the text content and the determined content of the e-book relationship;所述音频播放模块,用于根据所述文字内容对应的真实人声音频与确定的所述电子书内容之间的对应关系,获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。The audio playing module is configured to obtain real vocal audio corresponding to the e-book content according to a correspondence between the real vocal audio corresponding to the text content and the determined content of the e-book, and play The real vocal audio.
- 根据权利要求20所述的装置,其中,当所述真实人声音频包括所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个时,The apparatus according to claim 20, wherein when said real vocal audio includes at least two of said movie audio, said electronic book content reading audio, and said user audio,所述音频播放模块,用于按照预设的优先级,从所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个中,获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频;The audio playing module is configured to obtain, according to a preset priority, a content corresponding to the e-book content from at least two of the audio-visual line audio, the e-book content reading audio, and the user audio. Real vocal audio and play the real vocal audio;或者,or,所述音频播放模块,用于接收用户对所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个对应的选项的选择操作,获得所述选择操作所选择的与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频;The audio playing module is configured to receive a selection operation of a user corresponding to at least two of the audio-visual station audio, the e-book content reading audio, and the user audio, to obtain the selected operation of the selection operation The real vocal audio corresponding to the e-book content, and playing the real vocal audio;或者,or,所述音频播放模块,用于根据用户播放真实人声音频的历史数据,确定用户的音频类型偏好;根据所述用户的音频类型偏好,从所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个中,获得与所述电子书内容相对应的真实人声音频,并播放所述真实人声音频。The audio playing module is configured to determine a user's audio type preference according to the historical data of the user playing the real vocal audio; and read the audio from the movie and television word audio, the electronic book content according to the user's audio type preference In at least two of the user audios, real vocal audio corresponding to the e-book content is obtained, and the real vocal audio is played.
- 根据权利要求25所述的装置,其中,所述装置还包括:The device of claim 25, wherein the device further comprises:显示模块,用于在所述音频播放模块接收用户对所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个对应的选项的选择操作之前,通过弹窗或者透明覆盖层显示所述影视台词音频、所述电子书内容朗读音频和所述用户音频中的至少两个对应的选项。a display module, configured to: before the audio playback module receives a user's selection operation of at least two corresponding options of the audio-visual line audio, the e-book content reading audio, and the user audio, by using a pop-up window or transparent The overlay layer displays at least two corresponding options of the audiovisual word audio, the e-book content spoken audio, and the user audio.
- 根据权利要求19所述的装置,其中,所述内容确定模块,用于根 据用于指示电子书进行语音播放的语音播放指令和对所述电子书的显示内容的选择操作,确定待语音播放的电子书内容。The device according to claim 19, wherein the content determining module is configured to determine a voice play to be played according to a voice play instruction for instructing the electronic book to perform voice play and a selection operation of the display content of the electronic book E-book content.
- 根据权利要求27所述的装置,其中,所述装置还包括:The device of claim 27, wherein the device further comprises:内容选择模块,用于在所述内容确定模块根据用于指示电子书进行语音播放的语音播放指令和对所述电子书的显示内容的选择操作,确定待语音播放的电子书内容之前,接收对所述电子书的显示内容的选择操作,根据所述选择操作确定待语音播放的电子书内容。a content selection module, configured to receive, before the content determining module determines a content of the e-book to be voice-played, according to a voice play instruction for instructing the e-book to perform voice playback and a selection operation of displaying content of the e-book The selection operation of the display content of the electronic book determines the content of the electronic book to be played by the voice according to the selection operation.
- 根据权利要求28所述的装置,其中,所述内容选择模块包括:The apparatus of claim 28, wherein the content selection module comprises:第一选择模块,用于接收用户对所述电子书的显示内容的第一操作,确定所述第一操作在所述显示内容中的第一作用点;接收用户对所述显示内容的第二操作,确定所述第二操作在所述显示内容中的第二作用点;将所述第一作用点和所述第二作用点之间的显示内容确定为待语音播放的电子书内容。a first selection module, configured to receive a first operation of the display content of the e-book by the user, determine a first action point of the first operation in the display content, and receive a second user to the display content And determining a second action point of the second operation in the display content; determining display content between the first action point and the second action point as an e-book content to be played by voice.
- 根据权利要求28所述的装置,其中,所述内容选择模块包括:The apparatus of claim 28, wherein the content selection module comprises:第二选择模块,用于接收用户对所述电子书的显示内容的第三操作,确定所述第三操作在所述显示内容中的第三作用点;以所述第三作用点为参考点,将包括所述第三作用点在内的第一设定范围内的显示内容确定为待语音播放的电子书内容;或者,将以所述第三作用点为起点的第二设定范围内的显示内容确定为待语音播放的电子书内容;或者,将以所述第三作用点为终点的第三设定范围内的显示内容确定为待语音播放的电子书内容。a second selection module, configured to receive a third operation of the display content of the e-book by the user, determine a third action point of the third operation in the display content, and use the third action point as a reference point Determining the display content in the first setting range including the third action point as the electronic book content to be played by the voice; or, in the second setting range starting from the third action point The display content is determined as the e-book content to be played by the voice; or the display content in the third setting range ending with the third action point is determined as the e-book content to be played by the voice.
- 根据权利要求28所述的装置,其中,所述内容选择模块包括:The apparatus of claim 28, wherein the content selection module comprises:第三选择模块,用于接收用户对所述电子书的显示内容的选择操作,确定所述选择操作所选择的显示内容对应的内容标记;将所述内容标记所标记的内容确定为待语音播放的电子书内容。a third selection module, configured to receive a user's selection operation on the display content of the electronic book, determine a content tag corresponding to the display content selected by the selection operation, and determine the content marked by the content tag as a voice to be played E-book content.
- 根据权利要求19所述的装置,其中,所述装置还包括:The device of claim 19, wherein the device further comprises:录制存储模块,用于在所述内容确定模块根据用于指示电子书进行语音播放的语音播放指令,确定待语音播放的电子书内容之前,接收用户通过电子书阅读应用为所述电子书的内容录制的朗读音频,将录制的所述音 频和对应的所述电子书的内容关联存储;和/或,接收用户通过电子书阅读应用为所述电子书的内容录制的评论音频,将所述评论音频和对应的所述电子书的内容关联存储。a recording storage module, configured to receive, by the content determining module, the content of the e-book through the e-book reading application before determining the e-book content to be played by the voice according to the voice playing instruction for instructing the e-book to perform voice playing Recording aloud audio, associating the recorded audio with the content of the corresponding e-book; and/or receiving a comment audio recorded by the user through the e-book reading application for the content of the e-book, the comment The audio is stored in association with the content of the corresponding e-book.
- 根据权利要求19所述的装置,其中,所述装置还包括:The device of claim 19, wherein the device further comprises:音频处理模块,用于接收对播放的所述真实人声音频的音频处理指令,对所述真实人声音频进行所述音频处理指令所指示的操作。And an audio processing module, configured to receive an audio processing instruction for the real vocal audio played, and perform an operation indicated by the audio processing instruction on the real vocal audio.
- 根据权利要求33所述的装置,其中,所述音频处理指令包括以下至少之一:用于指示暂停所述真实人声音频播放的暂停指令、用于指示调整所述真实人声音频的播放速度的第一调整指令、用于指示调整所述真实人声音频的播放进度的第二调整指令、用于指示退出所述真实人声音频播放的退出指令、用于指示切换真实人声音频的类型的切换指令。The apparatus according to claim 33, wherein said audio processing instruction comprises at least one of: a pause instruction for instructing suspension of said real human voice audio, for indicating adjustment of a playback speed of said real human voice audio a first adjustment instruction, a second adjustment instruction for instructing adjustment of a playback progress of the real vocal audio, an exit instruction for instructing to exit the real vocal audio play, and a type for indicating switching of the real vocal audio Switching instructions.
- 根据权利要求33或34所述的装置,其中,所述显示模块,还用于通过悬浮图标或悬浮窗口或透明覆盖层,显示所述音频处理指令。The device according to claim 33 or 34, wherein the display module is further configured to display the audio processing instruction by a floating icon or a floating window or a transparent overlay.
- 根据权利要求19所述的装置,其中,所述装置还包括:The device of claim 19, wherein the device further comprises:重确定模块,用于在播放所述真实人声音频的过程中,接收到对所述电子书的翻页操作,暂停所述真实人声音频的播放;根据所述翻页操作重新确定待语音播放的电子书内容;获得与重新确定的所述电子书内容相对应的真实人声音频并播放。a re-determination module, configured to receive a page turning operation on the e-book during a process of playing the real vocal audio, suspending playing of the real vocal audio; and re-determining the to-be-voiced according to the page turning operation The played e-book content; the real vocal audio corresponding to the re-determined e-book content is obtained and played.
- 一种终端设备,包括:处理器、存储器、通信接口和通信总线,所述处理器、所述存储器和所述通信接口通过所述通信总线完成相互间的通信;A terminal device includes: a processor, a memory, a communication interface, and a communication bus, wherein the processor, the memory, and the communication interface complete communication with each other through the communication bus;所述存储器用于存放至少一可执行指令,所述可执行指令使所述处理器执行如权利要求1-18任一项所述的电子书语音播放方法对应的操作。The memory is configured to store at least one executable instruction that causes the processor to perform an operation corresponding to the e-book voice playback method of any of claims 1-18.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710601433.6A CN107369462B (en) | 2017-07-21 | 2017-07-21 | Electronic book voice playing method and device and terminal equipment |
CN201710601433.6 | 2017-07-21 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019015613A1 true WO2019015613A1 (en) | 2019-01-24 |
Family
ID=60307242
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2018/096162 WO2019015613A1 (en) | 2017-07-21 | 2018-07-18 | Electronic-book voice playback method, apparatus, and terminal device |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN107369462B (en) |
WO (1) | WO2019015613A1 (en) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107369462B (en) * | 2017-07-21 | 2020-06-26 | 阿里巴巴(中国)有限公司 | Electronic book voice playing method and device and terminal equipment |
CN107992250A (en) * | 2017-12-20 | 2018-05-04 | 维沃移动通信有限公司 | A kind of display methods of electronic book documentary content, mobile terminal |
CN108509605A (en) * | 2018-04-03 | 2018-09-07 | 优视科技有限公司 | A kind of speech playing method of news information, device and terminal device |
CN108874266A (en) * | 2018-06-27 | 2018-11-23 | 北京微播视界科技有限公司 | Text playback method, client, terminal and storage medium |
CN110797001B (en) * | 2018-07-17 | 2022-04-12 | 阿里巴巴(中国)有限公司 | Method and device for generating voice audio of electronic book and readable storage medium |
TWI717627B (en) * | 2018-08-09 | 2021-02-01 | 台灣大哥大股份有限公司 | E-book apparatus with audible narration and method using the same |
CN109189983A (en) * | 2018-09-18 | 2019-01-11 | 王全志 | Speech playing method and device for study |
CN110032355B (en) * | 2018-12-24 | 2022-05-17 | 阿里巴巴集团控股有限公司 | Voice playing method and device, terminal equipment and computer storage medium |
CN109828711A (en) * | 2019-01-25 | 2019-05-31 | 努比亚技术有限公司 | A kind of reading management method, mobile terminal and the storage medium of mobile terminal |
CN111833903B (en) * | 2019-04-22 | 2024-06-18 | 珠海金山办公软件有限公司 | Method and device for executing operation task |
CN111324330B (en) * | 2020-02-07 | 2021-04-30 | 掌阅科技股份有限公司 | Electronic book playing processing method, computing device and computer storage medium |
CN111459446B (en) * | 2020-03-27 | 2021-08-17 | 掌阅科技股份有限公司 | Resource processing method of electronic book, computing equipment and computer storage medium |
CN113779204B (en) * | 2020-06-09 | 2024-06-11 | 浙江未来精灵人工智能科技有限公司 | Data processing method, device, electronic equipment and computer storage medium |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1412687A (en) * | 2001-10-17 | 2003-04-23 | 英业达集团(南京)电子技术有限公司 | Device capable of playing background music and reading electronic book aloud and its method |
CN1653517A (en) * | 2002-05-09 | 2005-08-10 | 汤姆森特许公司 | Text-to-speech converting for hand-held devices |
US20110119590A1 (en) * | 2009-11-18 | 2011-05-19 | Nambirajan Seshadri | System and method for providing a speech controlled personal electronic book system |
CN102576251A (en) * | 2009-09-02 | 2012-07-11 | 亚马逊技术股份有限公司 | Touch-screen user interface |
CN102723004A (en) * | 2011-03-29 | 2012-10-10 | 汉王科技股份有限公司 | Electronic document point-reading control method and apparatus |
CN105869446A (en) * | 2016-03-29 | 2016-08-17 | 广州阿里巴巴文学信息技术有限公司 | Electronic reading apparatus and voice reading loading method |
CN107369462A (en) * | 2017-07-21 | 2017-11-21 | 广州阿里巴巴文学信息技术有限公司 | E-book speech playing method, device and terminal device |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101968969B (en) * | 2010-10-22 | 2015-01-21 | 康佳集团股份有限公司 | Electronic book mobile device and electronic book background music playing method |
CN106960051B (en) * | 2017-03-31 | 2019-12-10 | 掌阅科技股份有限公司 | Audio playing method and device based on electronic book and terminal equipment |
-
2017
- 2017-07-21 CN CN201710601433.6A patent/CN107369462B/en active Active
-
2018
- 2018-07-18 WO PCT/CN2018/096162 patent/WO2019015613A1/en active Application Filing
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1412687A (en) * | 2001-10-17 | 2003-04-23 | 英业达集团(南京)电子技术有限公司 | Device capable of playing background music and reading electronic book aloud and its method |
CN1653517A (en) * | 2002-05-09 | 2005-08-10 | 汤姆森特许公司 | Text-to-speech converting for hand-held devices |
CN102576251A (en) * | 2009-09-02 | 2012-07-11 | 亚马逊技术股份有限公司 | Touch-screen user interface |
US20110119590A1 (en) * | 2009-11-18 | 2011-05-19 | Nambirajan Seshadri | System and method for providing a speech controlled personal electronic book system |
CN102723004A (en) * | 2011-03-29 | 2012-10-10 | 汉王科技股份有限公司 | Electronic document point-reading control method and apparatus |
CN105869446A (en) * | 2016-03-29 | 2016-08-17 | 广州阿里巴巴文学信息技术有限公司 | Electronic reading apparatus and voice reading loading method |
CN107369462A (en) * | 2017-07-21 | 2017-11-21 | 广州阿里巴巴文学信息技术有限公司 | E-book speech playing method, device and terminal device |
Also Published As
Publication number | Publication date |
---|---|
CN107369462A (en) | 2017-11-21 |
CN107369462B (en) | 2020-06-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019015613A1 (en) | Electronic-book voice playback method, apparatus, and terminal device | |
JP7065740B2 (en) | Application function information display method, device, and terminal device | |
US9031493B2 (en) | Custom narration of electronic books | |
US20210304799A1 (en) | Transcript-based insertion of secondary video content into primary video content | |
US11457061B2 (en) | Creating a cinematic storytelling experience using network-addressable devices | |
US20120276504A1 (en) | Talking Teacher Visualization for Language Learning | |
JP2015517684A (en) | Content customization | |
WO2021121023A1 (en) | Video editing method, video editing apparatus, terminal, and readable storage medium | |
CN112068750A (en) | House resource processing method and device | |
US20170194031A1 (en) | Method and device for generating video slides | |
US20180136828A1 (en) | Interactive management system for performing arts productions | |
US20220047954A1 (en) | Game playing method and system based on a multimedia file | |
CN109634501B (en) | Electronic book annotation adding method, electronic equipment and computer storage medium | |
WO2014154097A1 (en) | Automatic page content reading-aloud method and device thereof | |
US20150106394A1 (en) | Automatically playing audio announcements in music player | |
CN108845741A (en) | A kind of generation method, client, terminal and the storage medium of AR expression | |
US20200143813A1 (en) | Information processing device, information processing method, and computer program | |
KR101789221B1 (en) | Device and method for providing moving picture, and computer program for executing the method | |
WO2018095195A1 (en) | Method and device for customizing packaging box | |
US10123090B2 (en) | Visually representing speech and motion | |
US20200168222A1 (en) | Information processing device, information processing method, and program | |
WO2020026799A1 (en) | Information processing device, information processing method, and program | |
US20160077719A1 (en) | Interactive blocking and management for performing arts productions | |
CN109643539A (en) | Sound processing apparatus and method | |
KR101832464B1 (en) | Device and method for providing moving picture, and computer program for executing the method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18835085 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18835085 Country of ref document: EP Kind code of ref document: A1 |