KR20220055644A

KR20220055644A - Digital Audio Book Production System and the Method

Info

Publication number: KR20220055644A
Application number: KR1020200139958A
Authority: KR
Inventors: 이장우
Original assignee: 이장우
Priority date: 2020-10-27
Filing date: 2020-10-27
Publication date: 2022-05-04
Also published as: KR102465504B1; WO2022092565A1

Abstract

The present invention relates to a system and a method for producing a digital audio book and, more specifically, to a system and a method for producing a digital audio book, capable of producing an on-demand digital audio book by converting text data into digital audio data, producing a digital audio book by selectively applying a voice which a user wants in a designated form, and producing a customized digital audio book by enabling a user to upload text data. In accordance with an embodiment of the present invention, the system for producing a digital audio book includes: a digital audio book server concluding a contract with a publisher server holding copyrights and collecting book data of the corresponding publisher to convert the data into digital text data, and then, converting the same data into raw digital audio data of a digital voice file format, and converting the raw digital audio data of the voice file format into a tone selected by a user terminal to provide the data to the user terminal; and a plurality of user terminals accessing the digital audio book server through a wired/wireless network to select raw digital audio data or a desired tone to download and execute designated digital audio data made by converting the raw digital audio file into a desired tone, and uploading text data to download and execute the raw digital audio data or the designated digital audio data in regard to the text data.

Description

Digital Audio Book Production System and the Method

본 발명은 디지털 오디오북 제작시스템 및 그 방법에 관한 것으로서, 더욱 상세하게는 텍스트 데이터를 디지털 오디오 데이터로 변환하여 온디맨드(on-demand) 디지털 오디오북을 제작하고 지정형(designated type)으로 유저가 원하는 음색을 선택적으로 적용하여 디지털 오디오북을 제작하며, 유저가 텍스트 데이터를 업로딩해서 맞춤형(customized type) 디지털 오디오북을 제작하는 시스템 및 그 방법에 관한 것이다.The present invention relates to a digital audiobook production system and a method therefor, and more particularly, to convert text data into digital audio data to produce an on-demand digital audiobook, and to provide users with a designated type. A system and method for producing a digital audiobook by selectively applying a desired tone, and for producing a customized type digital audiobook by a user uploading text data.

성우나 자원봉사자들이 낭독(내레이션)을 통하여 녹음하여 저장하는 방식의 아날로그 오디오북에 비해, 디지털 오디오북은 문자나 화상과 같은 정보가 도서로 간행되었거나 간행될 수 있는 저작물의 내용이 내레이션 과정을 거치지 않고 디지털 텍스트 데이터를 디지털 오디오 데이터로 전자 기록매체·저장장치에 수록된 뒤, 유무선 정보통신망을 통해 컴퓨터나 휴대 단말기로 그 내용을 읽고 보고 들을 수 있도록 한 디지털 도서를 의미한다.Compared to analog audiobooks in which voice actors or volunteers record and save through reading (narration), digital audiobooks do not contain information such as text or images that have been published as books or that can be published without going through the narration process. It refers to a digital book that contains digital text data as digital audio data in an electronic recording medium or storage device, and then reads, sees, and listens to its contents with a computer or mobile terminal through a wired/wireless information and communication network.

이 디지털 오디오북은 서적의 문자나 화상이 표시되는 전용의 리더기 또는 디스플레이 수단이 마련된 PC 등의 단말기를 통해 시각적, 청각적으로 표시되도록 하여 구현되며, 그 기술이 점차 발달함에 따라 사용자가 직접 읽는 형태에서 벗어나 서적의 내용을 음성(디지털 오디오 데이터)으로 출력하여 시각장애인이나 유아, 어린이 등이 서적과 친숙할 수 있도록 하거나, 운전이나 운동 등 비주얼(visual) 데이터를 볼 수 없는 상황에서 유용하게 오더블(audible) 데이터로 출력하는 형태이다.This digital audiobook is implemented so that it is displayed visually and aurally through a terminal such as a PC equipped with a dedicated reader or display means on which text or images of books are displayed, and the user reads directly as the technology develops. Outputs the contents of the book as audio (digital audio data) so that the visually impaired, infants, children, etc. can become familiar with the book, or it can be useful in situations where visual data such as driving or exercise cannot be seen. (audible) This is a format that outputs data.

특히, 이 디지털 오디오북은 구매자의 입장에서 종이책에 비해 저렴한 가격, 온라인 구매(전자책 출판사 웹사이트에서 다운로드)를 통한 시간절약, 필요한 부분의 별도구매는 물론, 독서를 하면서 동영상 자료를 보거나 배경음악을 들을 수도 있는 이점을 제공하고, 출판사의 입장에서는 인쇄나 제본 등의 제작비와 유통비 절약, 적은 재고 부담과 책 내용의 손쉬운 업데이트 등에 의해 영업상의 이익을 얻을 수 있게 된다.In particular, this digital audiobook offers a lower price compared to paper books from the buyer's point of view, saves time through online purchase (download from the e-book publisher's website), purchases necessary parts separately, as well as watching video materials while reading or background It provides the advantage of being able to listen to music, and from the standpoint of publishers, it is possible to obtain business profits by saving production and distribution costs such as printing and bookbinding, and by reducing inventory burden and easy updating of book contents.

그런데 기존의 아날로그 오디오북을 제작하고자 할 경우, 성우를 채용하여 전용의 스튜디오에서 내레이션과 녹음의 출판과정을 수행하기 때문에 과도한 제작경비와 책 한 권당 최소 7~8시간의 녹음 과정을 거쳐야 하므로 절대로 대량 생산체제를 갖출 수 없는 문제점이 발생된다.However, if you want to produce an existing analog audiobook, you have to go through the excessive production cost and at least 7~8 hours of recording process for each book, because the publishing process of narration and recording is performed in a dedicated studio by employing voice actors. A problem arises that a production system cannot be established.

특히, 상기와 같은 과도한 제작경비와 아날로그 방식의 제작시간 및 전용의 스튜디오를 갖춘 오디오북 출판사의 높은 벽에 전세계에서 실시간으로 출판되는 각종 소설, 수필 등의 문학이나 전공기술 등의 전문분야 서적들이 오디오북으로 제작되지 못하여 오디오북 자체가 대단히 제한된 수량만 한정적으로 제작되는 문제점이 있었다.In particular, on the high walls of audiobook publishing houses equipped with excessive production costs, analog production time, and dedicated studios, various novels and essays published in real time around the world, such as literature and specialized books such as major technology, are displayed in audio. Since the book could not be produced, there was a problem in that only a very limited quantity of the audiobook itself was produced.

등록번호 제10-1789057호(공고일자 2017년10월23일)Registration No. 10-1789057 (Announcement date October 23, 2017)

본 발명은 상술한 문제점을 해결하기 위하여 안출된 것으로서, 오디오북을 더욱 저렴한 경비에 의해 손쉽고 빠르게 실시간으로 제작할 수 있도록 하고, 이를 통하여 전세계 서적들이 필요시 원하는 대로 디지털 오디오북으로 생성될 수 있도록 하는 디지털 오디오북 제작시스템 및 그 방법을 제공하는데 그 목적이 있다.The present invention has been devised to solve the above problems, and it is possible to easily and quickly produce an audiobook in real time at a lower cost, and through this, a digital audiobook that can be created as a digital audiobook as desired by books worldwide. An object of the present invention is to provide an audiobook production system and a method therefor.

상술한 목적을 달성하기 위한 본 발명의 실시예에 따른 디지털 오디오북 제작시스템은, 저작권을 갖는 출판사 서버와 계약을 체결하고 해당 출판사의 서적 데이터를 수집하여 디지털 텍스트 데이터로 변환하고, 이를 디지털 음성 파일 형태의 로(Raw) 디지털 오디오 데이터로 변환하며, 상기 음성 파일 형태의 로(Raw) 디지털 오디오 데이터를 유저 단말에서 선택한 음색으로 변환해서 유저 단말에 제공하는 디지털 오디오북 서버; 및A digital audiobook production system according to an embodiment of the present invention for achieving the above object concludes a contract with a publisher server having a copyright, collects book data of the publisher, converts it into digital text data, and converts it into digital audio file a digital audiobook server that converts raw digital audio data in the form of audio files, and converts the raw digital audio data in the form of audio files into tones selected by the user terminal, and provides the converted raw digital audio data to the user terminal; and

상기 디지털 오디오북 서버에 유무선 네트워크를 통해 접속하여 로(Raw) 디지털 오디오 데이터나, 원하는 음색을 선택해서 상기 로 디지털 오디오 파일을 원하는 음색으로 변환한 지정 디지털 오디오 데이터를 다운로드하여 실행하며, 텍스트 데이터를 업로드해서 이 텍스트 데이터에 대한 로(Raw) 디지털 오디오 데이터나 지정 디지털 오디오 데이터를 다운로드하여 실행하는 다수의 유저 단말;It connects to the digital audiobook server through a wired/wireless network to download and execute raw digital audio data or designated digital audio data obtained by selecting a desired tone and converting the raw digital audio file into a desired tone, and processing text data. a plurality of user terminals for uploading and downloading and executing raw digital audio data or designated digital audio data for the text data;

을 포함하여 구성된다.is comprised of

또한, 상기 유저 단말은 디지털 오디오 데이터를 실행하면서 듣다가 중요한 대목이 있을 경우 음성 명령을 통하여 검색이나 추출을 통해 디지털 오디오 데이터의 필요 부분을 오디오 및 텍스트 데이터로 저장하는 것을 특징으로 한다..In addition, the user terminal is characterized in that the necessary part of the digital audio data is stored as audio and text data through search or extraction through a voice command when there is an important passage while listening to the digital audio data.

또한, 상기 디지털 오디오북 서버는 수집한 디지털 텍스트 데이터나, 유저 단말에서 업로드한 디지털 텍스트 데이터를 각종 언어로 번역하고, 번역 디지털 텍스트 데이터를 로 디지털 오디오 데이터나 지정 디지털 오디오 데이터로 변환하는 것을 특징으로 한다.In addition, the digital audio book server translates the collected digital text data or digital text data uploaded from the user terminal into various languages, and converts the translated digital text data into raw digital audio data or designated digital audio data. do.

또한, 상기 디지털 오디오북 서버는 수집한 디지털 텍스트 데이터나, 유저 단말에서 업로드한 디지털 텍스트 데이터에 대해 인공지능으로 전체 문맥을 분석하여 문맥에 어울리는 음색으로 로 디지털 오디오 데이터를 생성하는 것을 특징으로 한다.In addition, the digital audiobook server is characterized in that the digital text data uploaded from the user terminal or collected digital text data is analyzed in full context with artificial intelligence to generate raw digital audio data with tones suitable for the context.

또한, 상기 디지털 오디오북 서버는 유저 단말의 유저를 무료의 일반 유저와 유료의 고객 유저로 구분하여 관리하고, 일반 유저와 고객 유저의 디지털 오디오북 서비스를 차등 제공하는 것을 특징으로 한다.In addition, the digital audio book server is characterized in that the user of the user terminal is divided into free general users and paid customer users, and managed, and differentially provides digital audio book services for general users and customer users.

그리고 상기 디지털 오디오북 서버는 유저 단말을 통한 유저의 사용 데이터를 인공지능으로 분석하여 유저가 필요로 하거나 선호하는 디지털 오디오 데이터를 유저 단말에 추천하는 것을 특징으로 한다.In addition, the digital audio book server is characterized in that it analyzes the user's usage data through the user terminal with artificial intelligence and recommends digital audio data required or preferred by the user to the user terminal.

본 발명의 실시예에 따른 디지털 오디오북 제작방법은, (A) 유저 단말에서 유무선 네트워크를 통해 디지털 오디오북 서버에 접속하여 디지털 오디오북 서버에서 제공하는 디지털 텍스트 데이터를 선택하거나 자체 저장한 디지털 텍스트 데이터를 업로드하는 단계;In the digital audiobook production method according to an embodiment of the present invention, (A) a user terminal accesses a digital audiobook server through a wired/wireless network and selects digital text data provided by the digital audiobook server or digital text data stored by itself uploading;

(B) 상기 디지털 텍스트 파일을 선택한 경우 디지털 오디오북 서버에서 이를 디지털 음성 파일 형태의 로 디지털 오디오 데이터로 변환하는 단계;(B) converting the digital text file into raw digital audio data in the form of a digital audio file in a digital audio book server when the digital text file is selected;

(C) 상기 유저 단말에서 원하는 음색을 선택한 경우 디지털 오디오북 서버에서 로 디지털 오디오 데이터를 선택된 음색으로 변환하여(지정 디지털 오디오 데이터로 변환하여) 유저 단말에 제공하는 단계;(C) converting raw digital audio data into the selected tone (converted into designated digital audio data) in a digital audiobook server when a desired tone is selected by the user terminal, and providing the converted raw digital audio data to the user terminal;

(D) 상기 유저 단말에서 제공된 지정 디지털 오디오 데이터를 실행하여 디지털 오디오북 서비스를 이용하는 단계;(D) using the digital audio book service by executing the designated digital audio data provided by the user terminal;

(E) 상기 (A)단계에서 자체 저장한 디지털 텍스트 데이터를 업로드할 때 원하는 언어를 선택하여 업로드할 경우 디지털 오디오북 서버에서 업로드한 디지털 텍스트 데이터를 선택된 언어로 번역하여 번역 디지털 텍스트 데이터를 생성한 후, 상기 (B)단계 이후를 수행하는 것을 특징으로 한다.(E) When uploading the self-stored digital text data in step (A), select the desired language and upload the digital text data uploaded from the digital audiobook server to the selected language to generate translated digital text data Then, it is characterized in that after the step (B) is performed.

또한, 상기 (B)단계에서 디지털 오디오북 서버가 인공지능으로 전체 문맥을 분석하여 문맥에 부합하는 음색을 지정하는 것을 특징으로 한다.In addition, in the step (B), the digital audiobook server analyzes the entire context with artificial intelligence and designates a tone matching the context.

그리고 (F) 상기 (D)단계의 지정 디지털 오디오 데이터를 실행하는 도중에 유저 단말의 음성 명령을 통해 검색(indexing)하거나 필요 부분을 추출(copy)하는 단계와,and (F) searching (indexing) or extracting a necessary part through a voice command of a user terminal while executing the designated digital audio data of step (D);

(G) 상기 필요 부분을 추출(copy)하는 경우 추출된 디지털 오디오/텍스트 데이터를 유저 단말에 저장(paste)하는 단계 및(G) storing (paste) the extracted digital audio/text data in the user terminal when the necessary part is extracted (copy); and

(G') 상기 검색(indexing)하는 경우 인덱싱된 디지털 오디오/텍스트 데이터를 반복 실행하거나 유저 단말에 저장하는 단계를 더 수행하는 것을 특징으로 한다.(G') In the case of indexing, the step of repeatedly executing the indexed digital audio/text data or storing the indexed digital audio/text data in the user terminal is further performed.

상술한 과제의 해결 수단에 의하면, 제작경비를 상승시키고 제작시간이 과도하게 소비되는 방식인 별도의 스튜디오나 성우 등에 의존하지 않고 대단히 저렴하고 실시간으로 대량생산이 가능한 시스템에 의해 디지털 오디오북의 제작이 가능하게 함으로써, 전세계에 출판되었거나 출판될 각종 서적들을 유저 단말을 통하여 책을 읽을 수 없는 환경에서 저렴하고 편리하게 오디오북을 듣게 함으로써 전인류의 집단 지성을 획기적으로 끌어 올릴 수 있는 플랫폼이 될 수 있다.According to the means to solve the above problems, the production of digital audiobooks is achieved by a system that is very inexpensive and capable of mass production in real time without relying on a separate studio or voice actor, which is a method that increases production cost and consumes excessive production time. By making this possible, it can become a platform that can dramatically raise the collective intelligence of all mankind by allowing users to listen to audiobooks cheaply and conveniently in an environment where books that have been or will be published around the world cannot be read through user terminals. .

도 1은 본 발명의 실시예에 따른 디지털 오디오북 제작시스템의 구성도이다.
도 2는 도 1에 나타낸 디지털 오디오북 서버의 내부 구성도이다.
도 3은 본 발명의 실시예에 따른 디지털 오디오북 제작방법을 나타내는 순서도이다.
도 4는 본 발명의 실시예에 따른 디지털 오디오 데이터 실행 중 음성명령을 통한 인덱싱 및 추출 저장 방법을 나타내는 순서도이다.1 is a block diagram of a digital audiobook production system according to an embodiment of the present invention.
FIG. 2 is an internal configuration diagram of the digital audiobook server shown in FIG. 1 .
3 is a flowchart illustrating a digital audiobook production method according to an embodiment of the present invention.
4 is a flowchart illustrating a method of indexing, extracting, and storing digital audio data through a voice command while executing digital audio data according to an embodiment of the present invention.

이하 본 발명의 실시예에 대하여 첨부된 도면을 참고로 그 구성 및 작용을 설명하기로 한다.Hereinafter, the configuration and operation of the embodiment of the present invention will be described with reference to the accompanying drawings.

도면들 중 동일한 구성요소들에 대해서는 비록 다른 도면상에 표시되더라도 가능한 한 동일한 참조번호 및 부호들로 나타내고 있음에 유의해야 한다.It should be noted that the same components in the drawings are indicated by the same reference numbers and symbols as much as possible even though they are indicated in different drawings.

하기에서 본 발명을 설명함에 있어, 관련된 공지 기능 또는 구성에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우에는 그 상세한 설명을 생략할 것이다.In the following description of the present invention, if it is determined that a detailed description of a related known function or configuration may unnecessarily obscure the gist of the present invention, the detailed description thereof will be omitted.

또한, 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미한다.In addition, when a part "includes" a certain component, this means that other components may be further included rather than excluding other components unless otherwise stated.

도 1은 본 발명의 실시예에 따른 디지털 오디오북 제작시스템의 구성도이고, 도 2는 도 1에 나타낸 디지털 오디오북 서버의 내부 구성도이다.1 is a configuration diagram of a digital audiobook production system according to an embodiment of the present invention, and FIG. 2 is an internal configuration diagram of the digital audiobook server shown in FIG.

도 1에 도시된 바와 같이 본 발명의 실시예에 따른 디지털 오디오북 제작시스템은 다수의 유저 단말(100a,100b,…,100n)과 디지털 오디오북 서버(300)가 유무선 네트워크(200)를 통하여 연결된다.As shown in FIG. 1 , in the digital audiobook production system according to an embodiment of the present invention, a plurality of user terminals 100a, 100b, ..., 100n and a digital audiobook server 300 are connected through a wired/wireless network 200 . do.

상기 유저 단말(100a,100b,…,100n)은 유무선 네트워크(200)를 통하여 디지털 오디오북 서버(300)에 접속하여 디지털 오디오북 서비스를 제공받는 유저가 소지하는 단말로서 예를 들어 PC나 스마트폰일 수 있다.The user terminals 100a, 100b, ..., 100n are terminals possessed by a user who is provided with a digital audiobook service by accessing the digital audiobook server 300 through the wired/wireless network 200, for example, a PC or a smartphone. can

이를 위해 상기 유저 단말(100a,100b,…,100n)은 디지털 오디오북 서버(300)에서 제작하여 로 디지털 오디오 DB(312)에 저장한 로(Raw) 디지털 오디오 데이터를 다운로드하여 실행함으로써 편리하고 자유롭게 다양한 서적(디지털 오디오 데이터)을 들을 수 있게 하는 온디맨드(on-demand) 디지털 오디오북 서비스를 이용할 수 있다.To this end, the user terminals 100a, 100b, ..., 100n conveniently and freely download and execute raw digital audio data produced by the digital audio book server 300 and stored in the raw digital audio DB 312 . An on-demand digital audiobook service that allows you to listen to a variety of books (digital audio data) is available.

이때 유저 단말(100a,100b,…,100n)에서 원하는 음색을 선택해서 디지털 오디오북 서버(300)를 통해 상기 로 디지털 오디오 데이터를 원하는 음색으로 변환하여 지정(designated) 디지털 오디오 DB(314)에 저장할 수 있다.At this time, the user terminals 100a, 100b, ..., 100n select the desired tone, convert the raw digital audio data into the desired tone through the digital audiobook server 300, and store it in the designated digital audio DB 314. can

또한, 상기 유저 단말(100a,100b,…,100n)은 미리 제작되어 있는 디지털 오디오북이 아니라, 유저 단말에서 자체 저장하고 있는 텍스트 데이터를 디지털 오디오 데이터로 변환하고자 할 경우 텍스트 데이터를 유저 단말(100a,100b,…,100n)에서 디지털 오디오북 서버(300)에 업로드하여 디지털 오디오북 서버(300)를 통해 로 디지털 오디오 데이터로 변환하고 원하는 음색을 선택 변환함으로써 맞춤형(customized) 디지털 오디오 데이터를 제작할 수 있다.In addition, when the user terminals 100a, 100b, ..., 100n want to convert the text data stored in the user terminal into digital audio data, not the digital audio book that is produced in advance, the text data is converted into the user terminal 100a. ,100b,…,100n) upload to the digital audiobook server 300, convert it into raw digital audio data through the digital audiobook server 300, and select and convert the desired tone to produce customized digital audio data. there is.

또한, 상기 유저 단말(100a,100b,…,100n)은 디지털 오디오북 서버(300)의 디지털 오디오 데이터를 실행하면서 듣다가 중요한 대목이 있을 경우 음성 명령을 통하여 검색이나 추출을 통해 디지털 오디오 데이터의 필요 부분을 오디오 및 텍스트 데이터로 저장할 수 있다.In addition, the user terminals 100a, 100b, ..., 100n require digital audio data through search or extraction through a voice command when there is an important passage while listening to the digital audio data of the digital audio book server 300 while executing it. Parts can be saved as audio and text data.

이를 통해 기존 오디오북의 경우 듣고 있다가 중요한 대목이 있어 저장하고자 할 경우 실행을 멈추고 기록해야 하는 번거로움을 해결할 수 있다.Through this, in the case of an existing audiobook, if you want to save an important passage after listening to it, you can solve the hassle of stopping and recording it.

또한, 상기 유저 단말(100a,100b,…,100n)은 디지털 오디오북 서버(300)의 번역 모듈(304)의 텍스트 데이터 번역 기능을 통하여 원하는 언어로 번역시킨 후 번역된 텍스트 데이터를 디지털 오디오 데이터로 변환 생성할 수 있다.In addition, the user terminals 100a, 100b, ..., 100n translate the translated text data into digital audio data after translating it into a desired language through the text data translation function of the translation module 304 of the digital audio book server 300 . You can create transformations.

예를 들어 한글로 된 텍스트 데이터를 디지털 오디오 데이터로 제작하되, 텍스트 데이터를 선택하는(원하는) 언어로 번역시켜 번역된 텍스트 데이터를 디지털 오디오 데이터로 제작함으로써, 어떠한 텍스트 데이터를 원하는 언어의 디지털 오디오 데이터로 제작하고 유저가 디지털 오디오 데이터를 듣다가 원하는 언어로 전환하여 들을 수 있으며, 이를 통해 언어 학습에 유용하게 활용할 수 있다.For example, text data in Korean is produced as digital audio data, but text data is translated into a selected (desired) language and the translated text data is produced as digital audio data, so that any text data can be converted into digital audio data of a desired language. and the user can switch to the desired language while listening to digital audio data, and through this, it can be usefully used for language learning.

그리고 상기 유저 단말(100a,100b,…,100n)은 부정확하거나 부적합한 디지털 오디오 데이터에 대하여 디지털 오디오북 서버(300)에 업로드를 통해 업그레이드시키고 빅데이터화함으로써 디지털 오디오 서버(300)의 디지털 오디오 데이터 생성 알고리즘을 지속적으로 고도화시킬 수 있다.In addition, the user terminals 100a, 100b, ..., 100n upgrade digital audio data through uploading to the digital audio book server 300 for inaccurate or inappropriate digital audio data and convert it into big data, thereby generating digital audio data of the digital audio server 300. can be continuously upgraded.

도 2에 도시된 바와 같이, 디지털 오디오북 서버(300)는 라이센싱 모듈(301), TTS 모듈(302), 변환 모듈(303), 번역 모듈(304) 및 분석 모듈(305)을 포함하고, 서적에 대한 저작권을 갖는 출판사 서버(400) 및 성우의 내레이션과 흡사한 다양한 음색을 구비한 음색 서버(500)와 유무선으로 연결되어 있다.As shown in FIG. 2 , the digital audiobook server 300 includes a licensing module 301 , a TTS module 302 , a conversion module 303 , a translation module 304 and an analysis module 305 , It is connected to the publisher server 400 having the copyright for , and the tone server 500 having various tones similar to the narration of a voice actor by wire or wireless.

상기 라이센싱 모듈(301)은 각종 출판사 서버와(400) 지적재산권 계약이나 저작권 계약을 체결하고 해당 출판사의 서적 데이터를 수집하여 디지털 텍스트로 변환해서 디지털 텍스트 DB(310)에 저장 관리한다.The licensing module 301 concludes an intellectual property right contract or a copyright contract with various publisher servers 400 , collects book data of the publisher, converts it into digital text, and stores and manages it in the digital text DB 310 .

상기 TTS 모듈(302)은 상기 디지털 텍스트 DB(310)에 저장된 텍스트 파일 형태의 각종 디지털 텍스트 데이터를 디지털 음성 파일 형태의 로 디지털 오디오 데이터로 변환하여 로 디지털 오디오 DB(312)에 저장 관리한다.The TTS module 302 converts various digital text data in the form of a text file stored in the digital text DB 310 into raw digital audio data in the form of a digital voice file, and stores and manages the data in the raw digital audio DB 312 .

여기 TTS 모듈(302)은 TTS 알고리즘을 통하여 최적의 운율 모델을 추출하여 자연음 및 자연 음색에 가깝게 디지털 음성 파일 형태로 변환한다.Here, the TTS module 302 extracts an optimal prosody model through the TTS algorithm and converts it into a digital voice file form close to natural sound and natural tone.

상기 변환 모듈(303)은 로 디지털 오디오 DB(312)에 저장된 디지털 음성 파일 형태의 로 디지털 오디오 데이터를 유저 단말(100a,100b,…,100n)의 요청에 따라 음색 서버(500)에서 음색을 선택하여 선택된 음색으로 변환해서 유저 단말(100a,100b,…,100n)에 제공하고, 지정 디지털 오디오 DB(314)에 저장 관리한다.The conversion module 303 converts raw digital audio data in the form of digital audio files stored in the raw digital audio DB 312 to a tone from the tone server 500 according to the request of the user terminals 100a, 100b, ..., 100n. The selected tone is converted into the selected tone and provided to the user terminals 100a, 100b, ..., 100n, and stored and managed in the designated digital audio DB 314.

인간의 귀는 매우 예민하고 감각적이므로 반복적인 소리에 쉽게 피로를 느끼고 집중력이 저하되나, 본 발명에 따르면 유저가 상황에 따라 듣고 싶어하는 음색을 선택적으로 적용하여 맞춤형 지정 디지털 오디오 데이터를 제작하여 상시적으로 디지털 오디오북을 이용하더라도 항상 새롭고 쉽게 피로를 느끼거나 집중력이 저하되지 않는다.The human ear is very sensitive and sensual, so it is easy to get tired of repetitive sounds and loses concentration. Even when using digital audiobooks, it is always new and does not easily get tired or lose concentration.

상기 번역 모듈(304)은 라이센싱 모듈(301)을 통하여 수집되어 디지털 텍스트 DB(310)에 저장된 텍스트 파일 형태의 각종 디지털 텍스트 데이터나, 유저 단말(100a,100b,…,100n)에서 업로드된 디지털 텍스트 데이터를 각종 언어로 번역하여 번역 디지털 텍스트 DB(316)에 저장 관리한다.The translation module 304 includes various digital text data in the form of text files collected through the licensing module 301 and stored in the digital text DB 310, or digital text uploaded from the user terminals 100a, 100b, ..., 100n. The data is translated into various languages and stored and managed in the translated digital text DB 316 .

상기 번역된 디지털 텍스트 데이터는 상기 유저 단말의 요청에 따라 상기 TTS 모듈(302)과 변환 모듈(303)을 통하여 지정 디지털 오디오 데이터로 변환되어 지정 디지털 오디오 DB(314)에 저장 관리된다.The translated digital text data is converted into designated digital audio data through the TTS module 302 and the conversion module 303 according to the request of the user terminal, and stored and managed in the designated digital audio DB 314 .

상기 분석 모듈(305)은 유저 단말(100a,100b,…,100n)을 통한 유저의 사용 데이터를 인공지능으로 분석하여 유저가 필요로 하거나 선호하는 디지털 오디오 데이터를 추천 제공한다.The analysis module 305 analyzes the user's usage data through the user terminals 100a, 100b, ..., 100n with artificial intelligence, and recommends and provides digital audio data required or preferred by the user.

또한, 상기 분석 모듈(305)은 수집한 디지털 텍스트 데이터나, 유저 단말(100a,100b,…,100n)에서 업로드한 디지털 텍스트 데이터를 인공지능으로 전체 문맥을 분석하여 문맥에 가장 잘 어울리는 최적의 음색으로 로(기본) 디지털 오디오 데이터를 생성하되, 특히 대화형 텍스트에 대하여 문맥에 가장 잘 부합하는 음색으로 디지털 오디오 데이터를 생성함으로써 가장 자연음 및 자연음색에 가깝게 변환한다.In addition, the analysis module 305 analyzes the entire context of the collected digital text data or digital text data uploaded from the user terminals 100a, 100b, ..., 100n with artificial intelligence, and the optimal tone that best suits the context It generates raw (basic) digital audio data with , but converts it to the most natural and natural tones by generating digital audio data with tones that best match the context, especially for interactive texts.

그리고 상기 각 모듈의 구현을 위해 상기 디지털 오디오북 서버(300)는 디지털 텍스트 DB(310), 로 디지털 오디오 DB(312), 지정 디지털 오디오 DB(314), 번역 디지털 텍스트 DB(316)를 운용하고 관리한다.And for the implementation of each module, the digital audio book server 300 operates a digital text DB 310, a raw digital audio DB 312, a designated digital audio DB 314, and a translation digital text DB 316, manage

한편, 상기 디지털 오디오북 서버(300)는 유저 단말(100a,100b,…,100n)의 유저를 디지털 오디오북 서비스 이용료를 납부하지 않는 무료의 일반 유저와 디지털 오디오북 서비스 이용료를 납부하는 유료의 고객 유저로 구분하여 미도시된 유저 DB에 저장 관리하고, 일반 유저와 고객 유저의 디지털 오디오북 서비스를 다르게(차별하여) 제공할 수 있다.On the other hand, the digital audiobook server 300 provides the users of the user terminals 100a, 100b, ..., 100n for free general users who do not pay digital audiobook service fees and paid customers who pay digital audiobook service fees. It is possible to classify users into users, store and manage them in a user DB (not shown), and provide digital audiobook services for general users and customer users differently (differently).

예를 들어 일반 유저는 유저의 1일 또는 1회 사용 데이터를 제한하는 반면에 고객 유저의 1일 또는 1회 사용 데이터를 제한하지 않을 수도 있고, 일반 유저에게는 번역 모듈(304)의 번역 서비스나 분석 모듈(305)의 추천 서비스를 제공하지 않는 반면에 고객 유저에게는 번역이나 추천 서비스를 제공할 수 있다.For example, the general user may not limit the daily or one-time usage data of the customer user while limiting the user's daily or one-time usage data, and the general user may have a translation service or analysis of the translation module 304 . While not providing the recommendation service of the module 305 , a translation or recommendation service may be provided to the customer user.

도 3은 본 발명의 실시예에 따른 디지털 오디오북 제작방법을 나타내는 순서도이다.3 is a flowchart illustrating a digital audiobook production method according to an embodiment of the present invention.

유저는 유저 단말(100a,100b,…,100n)을 이용하여 유무선 네트워크(200)를 통해 디지털 오디오북 서버(300)에 접속하여(S302) 로그인을 수행한 후, 디지털 오디오북 서비스를 이용하고자 하는 디지털 텍스트 데이터를 디지털 텍스트 DB(310)에서 선택하거나 유저 단말(100a,100b,…,100n)에 저장된 서적(디지털 텍스트 데이터)을 업로드한다(S034).The user accesses the digital audiobook server 300 through the wired/wireless network 200 using the user terminals 100a, 100b, ..., 100n (S302), logs in, and wants to use the digital audiobook service. Digital text data is selected from the digital text DB 310 or books (digital text data) stored in the user terminals 100a, 100b, ..., 100n are uploaded (S034).

상기 디지털 텍스트 DB(310)에서 디지털 텍스트 데이터를 선택한 경우 TTS 모듈(302)에서 텍스트 파일 형태의 디지털 텍스트 데이터를 디지털 음성 파일 형태의 로 디지털 오디오 데이터로 변환하여 로 디지털 오디오 DB(312)에 저장한다(S306).When digital text data is selected in the digital text DB 310, the TTS module 302 converts the digital text data in the form of a text file into raw digital audio data in the form of a digital voice file and stores it in the raw digital audio DB 312. (S306).

이를 위해서 인공지능으로 전체 문맥을 분석하여 문맥에 가장 잘 부합하는 최적의 음색을 지정할 수도 있다.To this end, AI can analyze the entire context and designate the optimal tone that best matches the context.

다음 유저 단말(100a,100b,…,100n)이 음색 서버(500)에서 원하는 음색을 선택하면(S308), 변환 모듈(303)에서 로 디지털 오디오 데이터를 선택된 음색으로 변환하여(지정 디지털 오디오 데이터로 변환하여) 지정 디지털 오디오 DB(314)에 저장하고 유저 단말(100a,100b,…,100n)에 제공한다(S310).Next, when the user terminals 100a, 100b, ..., 100n select a desired tone from the tone server 500 (S308), the conversion module 303 converts the raw digital audio data into the selected tone (into designated digital audio data). converted) and stored in the designated digital audio DB 314 and provided to the user terminals 100a, 100b, ..., 100n (S310).

상기 유저 단말(100a,100b,…,100n)은 제공된 지정 디지털 오디오 데이터를 실행하여 디지털 오디오북 서비스를 이용한다.The user terminals 100a, 100b, ..., 100n use the digital audiobook service by executing the provided designated digital audio data.

상기 S304 단계에서 유저 단말(100a,100b,…,100n)에서 저장된 서적(디지털 텍스트 데이터)을 업로드하는 경우 유저가 원하는 언어를 선택하여 업로드할 경우도 있다(S320).In the case of uploading a book (digital text data) stored in the user terminals 100a, 100b, ..., 100n in step S304, the user may select and upload a desired language (S320).

이때 디지털 오디오북 서버(300)의 번역 모듈(304)에서는 업로드한 디지털 텍스트 데이터를 선택된 언어로 번역하여 번역 디지털 텍스트 데이터를 생성하고(S322), 이후 번역 디지털 텍스트 데이터에 대해 S306 단계를 포함한 이후 단계를 수행한다.At this time, the translation module 304 of the digital audiobook server 300 translates the uploaded digital text data into the selected language to generate translated digital text data (S322), and subsequent steps including step S306 for the translated digital text data carry out

도 4는 본 발명의 실시예에 따른 디지털 오디오 데이터 실행 중 음성명령을 통한 인덱싱 및 추출 저장 방법을 나타내는 순서도이다.4 is a flowchart illustrating a method of indexing, extracting, and storing digital audio data through a voice command while executing digital audio data according to an embodiment of the present invention.

도 3의 상기 S312 단계 즉, 지정 디지털 오디오 데이터를 실행하여 유저가 유저 단말(100a,100b,…,100n)을 통해 디지털 오디오북 서비스를 제공받는 도중에(S402) 중요한 대목이 있어 저장하고자 할 경우 유저 단말(100a,100b,…,100n)의 음성 명령(Search by Voice Command)을 통해 검색(Indexing)을 하거나 필요 부분을 추출(Copy)한다(S404).In the step S312 of FIG. 3, that is, when the user wants to store the specified digital audio data while receiving the digital audio book service through the user terminals 100a, 100b, ..., 100n (S402), the user wants to save it. Through the voice command (Search by Voice Command) of the terminals 100a, 100b, ..., 100n, a search (Indexing) or a necessary part is extracted (Copy) (S404).

상기 필요 부분을 추출(Copy)하는 경우 추출된 디지털 오디오/텍스트 데이터를 유저 단말(100a,100b,…,100n)에 저장(Paste)하고(S406), 상기 저장된 디지털 오디오/텍스트 데이터를 번역한다(S408).When the necessary part is extracted (Copy), the extracted digital audio/text data is stored in the user terminals 100a, 100b, ..., 100n (Paste) (S406), and the stored digital audio/text data is translated ( S408).

상기 검색(Indexing)을 하는 경우 인덱싱된 디지털 오디오/텍스트 데이터를 반복 실행하거나 유저 단말(100a,100b,…,100n)에 저장(Paste)하고(S410), 상기 저장된 디지털 오디오/텍스트 데이터를 번역한다(S412).In the case of the indexing, the indexed digital audio/text data is repeatedly executed or stored in the user terminals 100a, 100b, ..., 100n (Paste) (S410), and the stored digital audio/text data is translated. (S412).

이와 같이 본 발명은 모든 서적들을 디지털 오디오북 서버를 통해 디지털 오디오 데이터로 변환하여 온디맨드(on-demand) 디지털 오디오북을 생성하고 지정형(designated type)으로 유저가 원하는 음색을 선택적으로 적용하여 디지털 오디오북을 다운받을 수 있으며, 유저 단말에서 디지털 오디오북 서버에 접속하고 텍스트 데이터를 업로딩하여 유저가 맞춤형(customized type) 디지털 오디오 데이터로 변환하여 다운로드 받을 수 있다.As described above, the present invention converts all books into digital audio data through a digital audiobook server to generate an on-demand digital audiobook, and selectively applies a user's desired tone in a designated type to digital Audiobooks can be downloaded, and by accessing a digital audiobook server from a user terminal and uploading text data, the user can convert them into customized type digital audio data and download them.

또한, 생성된 디지털 오디오 데이터를 실행하고 들으면서 첫째 음성 명령을 통하여 디지털 오디오 데이터의 필요 부분을 오디오 및 텍스트 데이터로 추출하여 저장 기록할 수 있고, 둘째 음성 명령을 통하여 키워드로 검색하여 해당 데이터를 반복 재생하는 기술을 통하여 유저가 운전 중이나 운동 중 등 비주얼(visual) 데이터를 볼 수 없는 상황에서 유용하게 오더블(audible) 데이터를 실행하여 유익한 지식과 정보를 상시적으로 습득할 수 있다.In addition, while executing and listening to the generated digital audio data, the necessary part of the digital audio data can be extracted and recorded as audio and text data through the first voice command, and the data can be repeatedly reproduced by searching with a keyword through the second voice command. Through this technology, useful knowledge and information can be acquired at all times by executing audible data usefully in situations where the user cannot see visual data, such as while driving or exercising.

이상에서 본 발명에 대한 기술 사상을 첨부 도면과 함께 서술하였지만, 이는 본 발명의 바람직한 실시예를 예시적으로 설명한 것이지 본 발명을 한정하는 것은 아니다.Although the technical idea of the present invention has been described together with the accompanying drawings in the above, the preferred embodiment of the present invention is exemplarily described and does not limit the present invention.

또한, 이 기술 분야의 통상의 지식을 가진 자라면 누구나 본 발명의 기술 사상의 범주를 이탈하지 않는 범위 내에서 다양한 변형 및 모방이 가능함은 명백한 사실이다.In addition, it is a clear fact that various modifications and imitations are possible without departing from the scope of the technical spirit of the present invention by anyone having ordinary skill in the art.

100a,100b,…,100n: 유저 단말 200: 유무선 네트워크
300: 디지털 오디오북 서버 301: 라이센싱 모듈
302: TTS 모듈 303: 변환 모듈
304: 번역 모듈 305: 분석 모듈
400: 출판사 서버 500: 음색 서버100a, 100b,… ,100n: user terminal 200: wired and wireless network
300: digital audiobook server 301: licensing module
302: TTS module 303: conversion module
304: translation module 305: analysis module
400: publisher server 500: tone server

Claims

Sign a contract with a publisher server having a copyright, collect book data of the publisher, convert it into digital text data, convert it to raw digital audio data in the form of a digital audio file, and raw data in the form of an audio file ) a digital audio book server that converts digital audio data into a tone selected by the user terminal and provides it to the user terminal; and
It connects to the digital audiobook server through a wired/wireless network, downloads raw digital audio data, or selects a desired tone, and converts the raw digital audio file into a desired tone, downloads and executes the text data. a plurality of user terminals for uploading and downloading and executing raw digital audio data or designated digital audio data for the text data;
A digital audiobook production system comprising a.

According to claim 1,
The digital audio book production system, characterized in that the user terminal stores the necessary part of the digital audio data as audio and text data through search or extraction through a voice command when there is an important passage while listening to the digital audio data.

3. The method of claim 2,
wherein the digital audiobook server translates the collected digital text data or digital text data uploaded from the user terminal into various languages, and converts the translated digital text data into raw digital audio data or designated digital audio data. Audiobook production system.

3. The method of claim 2,
The digital audiobook server generates raw digital audio data with tones suitable for the context by analyzing the entire context of the collected digital text data or digital text data uploaded from the user terminal with artificial intelligence. production system.

According to claim 1,
The digital audiobook server divides and manages users of the user terminal into free general users and paid customer users, and differentially provides digital audiobook services for general users and customer users.

According to claim 1,
The digital audiobook server is a digital audiobook production system, characterized in that by analyzing the use data of the user through the user terminal with artificial intelligence, the digital audio data required or preferred by the user is recommended to the user terminal.

(A) accessing a digital audiobook server through a wired/wireless network in a user terminal, selecting digital text data provided by the digital audiobook server, or uploading self-stored digital text data;
(B) converting the digital text file into raw digital audio data in the form of a digital audio file in a digital audio book server when the digital text file is selected;
(C) converting raw digital audio data into the selected tone (converted into designated digital audio data) in a digital audiobook server when a desired tone is selected by the user terminal, and providing the converted raw digital audio data to the user terminal;
(D) using the digital audio book service by executing the designated digital audio data provided by the user terminal;
(E) When uploading the self-stored digital text data in step (A), select the desired language and upload the digital text data uploaded from the digital audiobook server to the selected language to generate translated digital text data Then, the digital audio book production method, characterized in that performing after the step (B).

8. The method of claim 7
The digital audiobook production method, characterized in that in the step (B), the digital audiobook server analyzes the entire context with artificial intelligence and designates a tone that matches the context.

8. The method of claim 7
(F) searching (indexing) or extracting a necessary part through a voice command of a user terminal while executing the designated digital audio data of step (D);
(G) storing (paste) the extracted digital audio/text data in the user terminal when the necessary part is extracted (copy); and
(G') The digital audio book production method, characterized in that in the case of indexing, iteratively executing the indexed digital audio/text data or storing the indexed digital audio/text data in a user terminal.