[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

WO2024169529A1 - Knowledge base construction method, data retrieval method and apparatus, and cloud device - Google Patents

Knowledge base construction method, data retrieval method and apparatus, and cloud device Download PDF

Info

Publication number
WO2024169529A1
WO2024169529A1 PCT/CN2024/073350 CN2024073350W WO2024169529A1 WO 2024169529 A1 WO2024169529 A1 WO 2024169529A1 CN 2024073350 W CN2024073350 W CN 2024073350W WO 2024169529 A1 WO2024169529 A1 WO 2024169529A1
Authority
WO
WIPO (PCT)
Prior art keywords
key
data
text
knowledge base
original data
Prior art date
Application number
PCT/CN2024/073350
Other languages
French (fr)
Chinese (zh)
Inventor
李鹤
Original Assignee
杭州阿里云飞天信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 杭州阿里云飞天信息技术有限公司 filed Critical 杭州阿里云飞天信息技术有限公司
Publication of WO2024169529A1 publication Critical patent/WO2024169529A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/316Indexing structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis

Definitions

  • the present application relates to the field of computer technology, and in particular to a knowledge base construction method, a data retrieval method, an apparatus and a cloud device.
  • the question-and-answer knowledge base is applied in intelligent customer service scenarios, in which the corresponding answer text is retrieved from the question-and-answer knowledge base based on the user's inquiry text and the user is replied.
  • a question-answering knowledge base is constructed by taking part of the text in the original text data as an index and taking the original text as the value of the index.
  • the question-answering knowledge base obtained by this construction method is not suitable for the retrieval of answer data of different modalities, which limits the application scope of the question-answering knowledge base in intelligent customer service scenarios.
  • a first aspect of an embodiment of the present application provides a method for constructing a knowledge base, comprising:
  • a second aspect of an embodiment of the present application provides a data retrieval method, comprising:
  • question-answering knowledge base answer data corresponding to the query data is retrieved, the modality of the answer data is one of text, image or video, and the question-answering knowledge base is constructed according to the knowledge base construction method of any one of the first aspects;
  • a third aspect of the present application embodiment provides a data retrieval method, which is applied to a terminal device.
  • the data retrieval method includes:
  • Receive answer data of the inquiry data sent by the server, the answer data is determined according to the data retrieval method of the second aspect.
  • a fourth aspect of the present application provides a knowledge base construction device, including:
  • An acquisition module used for acquiring multiple pieces of original data, where the modalities of the multiple pieces of original data are at least two of text, image or video;
  • a determination module for determining key text of the original data among the plurality of original data, determining a key of a key-value pair according to the key text, and determining a value of the key-value pair according to the original data, wherein the key text is used to describe the content of the original data;
  • the construction module is used to construct a question-answering knowledge base based on the key-value pairs corresponding to multiple pieces of original data.
  • the fifth aspect of an embodiment of the present application provides a cloud device, including: a processor, a memory, and a computer program stored in the memory and executable on the processor, wherein when the processor executes the computer program, the knowledge base construction method of the first aspect, the data retrieval method of the second aspect and/or the third aspect are implemented.
  • the sixth aspect of the embodiments of the present application provides a computer-readable storage medium, in which computer execution instructions are stored.
  • the computer execution instructions are executed by a processor, they are used to implement the knowledge base construction method of the first aspect, the data retrieval method of the second aspect and/or the third aspect.
  • a seventh aspect of an embodiment of the present application provides a computer program product, the program product comprising: a computer program, the computer program is stored in a readable storage medium, at least one processor of an electronic device can read the computer program from the readable storage medium, and at least one processor executes the computer program so that the electronic device executes the knowledge base construction method of the first aspect, the data retrieval method of the second aspect and/or the third aspect.
  • the embodiment of the present application is applied in an intelligent customer service scenario, by obtaining multiple pieces of original data, the modalities of the multiple pieces of original data are at least two of text, image or video; for the original data in the multiple pieces of original data, the key text of the original data is determined, and the key of the key-value pair is determined according to the key text, and the value of the key-value pair is determined according to the original data, and the key text is used to describe the content of the original data; according to the key-value pairs corresponding to the multiple pieces of original data, a question-value pairs corresponding to the multiple pieces of original data, a question-and-answer knowledge base is constructed, and a question-and-answer knowledge base that supports answer retrieval in different modalities can be constructed.
  • FIG1 is a diagram of an application scenario provided by an exemplary embodiment of the present application.
  • FIG2 is a flowchart of a method for constructing a knowledge base provided by an exemplary embodiment of the present application
  • FIG3 is a schematic diagram of a question-answering knowledge base provided by an exemplary embodiment of the present application.
  • FIG4 is a schematic diagram of another question-answering knowledge base provided by an exemplary embodiment of the present application.
  • FIG5 is a flowchart of a data retrieval method provided by an exemplary embodiment of the present application.
  • FIG6 is a flowchart of another data retrieval method provided by an exemplary embodiment of the present application.
  • FIG7 is a structural block diagram of a knowledge base construction device provided by an exemplary embodiment of the present application.
  • FIG8 is a schematic diagram of the structure of a cloud device provided by an exemplary embodiment of the present application.
  • Intelligent customer service is an automatic question-and-answer product.
  • the index (key in the key-value pair) construction method is to directly build a query index for a certain field.
  • this method is not suitable because the multi-source knowledge of intelligent customer service is usually in different modalities, and each modality includes different types of data. As the business develops, the amount of raw data will also increase. Therefore, how to design a question-and-answer knowledge base that can support the retrieval of answer data of different modalities and different types, and at the same time, the method of building the knowledge base is scalable and can be applied to new raw data is particularly important.
  • the knowledge base construction method includes: obtaining multiple pieces of original data, the modalities of the multiple pieces of original data are at least two of text, image or video; for the original data in the multiple pieces of original data, determining the key text of the original data, and determining the key of the key-value pair according to the key text, and determining the value of the key-value pair according to the original data, the key text is used to describe the content of the original data; constructing a question-and-answer knowledge base according to the key-value pairs corresponding to the multiple pieces of original data.
  • the present application can construct a question-and-answer knowledge base that supports answer retrieval in different modalities, and the construction method is extensible. When there is new original data, this method can be used to obtain key-value pairs for the new original data and incorporate them into the question-and-answer knowledge base.
  • the overall knowledge base construction method can be realized by means of a cloud computing system.
  • the server of the knowledge base construction method can be a cloud server, so as to run various neural network models with the advantage of cloud resources; relative to the cloud, the knowledge base construction method can also be applied to conventional servers or server arrays and other server-side devices, which are not limited here.
  • an application scenario diagram provided by the present application includes raw data of different modalities, such as text, images, or videos.
  • the text data comes from different types of text, such as conversations, comments, and graphs. These raw data are processed to build a question-and-answer knowledge base. After the question-and-answer knowledge base is put online for use, upon receiving the user's inquiry text, the corresponding answer data can be found in the question-and-answer knowledge base, and the answer data can be returned to the user to reply to the user's inquiry text.
  • FIG2 is a flowchart of a method for constructing a knowledge base provided by an exemplary embodiment of the present application.
  • the method for constructing the knowledge base used in the server, as shown in FIG2, specifically includes the following steps:
  • the modalities of the multiple pieces of original data are at least two of text, image or video.
  • the database includes multiple pieces of raw data, which come from different data sources, and the data of the data source is text mode, image mode or video mode.
  • the data source of the text mode includes: different types of data sources, such as user comment data source, person-to-person dialogue data source, product attribute data source, product knowledge graph data source, product details page data source and product homepage data source.
  • the user comment data source includes: user comment text on the quality of a certain product, the express delivery speed of the product or the service of the merchant.
  • the person-to-person dialogue data source includes: the dialogue text between the user and the merchant's manual customer service;
  • the product attribute data source includes: text descriptions of product attributes (such as category, size, color, etc.).
  • the product knowledge graph data source includes: nodes represent products, and the connections between nodes represent the knowledge graph of the connections between products.
  • the product details page data source contains text content on the image, which is used to describe the product. These text introduction contents can be identified by OCR (optical character recognition) technology and used as raw data.
  • OCR optical character recognition
  • the data source of the product homepage is the text content contained in the image, which is also used to describe the product. The text content can also be recognized by OCR technology and used as original data.
  • the image may include an image of a product object, or may include an image of text, such as the product details page, product homepage, etc.
  • the video may include a product introduction video, a product usage video, or a product installation video.
  • a piece of raw data such as a comment text is a piece of raw data
  • a person-to-person conversation is a piece of raw data
  • a product attribute is a piece of raw data
  • a knowledge graph is a piece of raw data
  • all the text recognized in an image is a piece of raw data
  • an image is a piece of raw data
  • a video is a piece of raw data.
  • the source of the original data is not limited.
  • At least one key text can be determined corresponding to a piece of raw data.
  • the key text is used to describe the content of the raw data. Determining the key text of the raw data includes: converting the raw data into natural language text; extracting the key text from the natural language text to obtain the key text, and the key text includes: at least one of the content theme, core viewpoint, keyword, and key entity of the raw data.
  • converting the original data into natural language text includes: processing the original data through a prompt (a natural language model) template to obtain natural language text.
  • the prompt template is pre-trained and can convert the original data into natural language text. If the original data is an image or video (including multiple frames of images), the image or video can be input into a pre-trained multimodal recognition model for processing, and a natural language text describing the corresponding image or video can be obtained.
  • key text is extracted from the natural language text to obtain the key text. It includes: inputting natural language text into a unified model to extract key text and obtain key text.
  • the unified model includes multiple pre-trained sub-models, which can realize the unification of multiple NLP (natural language) tasks, such as matching sub-models, classification sub-models or sorting sub-models.
  • the matching sub-model is used to realize the matching task
  • the classification sub-model is used to realize the classification task
  • the sorting sub-model is used to realize the sorting task.
  • the natural language text is input into the unified model, and at least one key text can be obtained through the processing of each sub-model.
  • the unified model can extract the core contents of the original data, such as content themes, core ideas, keywords, key entities, etc.
  • content themes such as whether the original data is about goods or quality
  • core ideas such as whether the original data is positive data or negative data
  • keywords such as keywords in the original data that describe the content theme attributes
  • key entities such as location, time, product name or product category, etc.
  • the key text can be used as the key of the key-value pair
  • the natural language text or original data can be used as the value of the key-value pair to obtain the key-value pair, thereby constructing a question-answering knowledge base.
  • one piece of original data corresponds to one piece of natural language text
  • one piece of natural language text can correspond to one key, which is one key text in the key text of the original data or a combination of multiple key texts.
  • the original data is X
  • the natural text corresponding to the original data is N
  • the key text corresponding to the original data includes topic A, core view B, keyword C, keyword D, key entity E and key entity F.
  • the key-value pairs included in the constructed question-answering knowledge base are shown in FIG3 .
  • the value of the key-value pair is determined based on the original data, including: segmenting the original data based on the key text to obtain data segments corresponding to the key text, the mode of the data segments is the same as the mode of the original data; and determining the value of the key-value pair based on the data segments.
  • the original data is text
  • the original data is segmented based on the key text to obtain data segments corresponding to the key text, including: using machine reading comprehension technology (MRC, Machine Reading Comprehension) to extract data segments describing the key text from the original data.
  • MRC Machine Reading Comprehension
  • the key text and the corresponding raw data are input into the understanding model based on machine reading comprehension technology, and the data fragments describing the key text are output.
  • the understanding model can remove the redundant noise data in the raw data, thereby improving the overall quality of the data fragments.
  • a key text has a corresponding data fragment in the raw data.
  • the original data is "the parameters of this projector are body weight 1kg, zoom multiple is fixed focus, and light source type is LED light source”
  • the key texts obtained are: projector, weight, zoom multiple, fixed focus, light source type, LED light source.
  • the data segment corresponding to "projector” can be the entire original data
  • the data segment corresponding to weight can be "body weight 1kg”
  • the data segment corresponding to zoom multiple and/or fixed focus is zoom multiple is fixed focus
  • the data segment corresponding to light source type and/or LED light source is light source type is LED light source.
  • the entire image can be used as the value of the key-value pair, or the original data can be segmented, and the local image obtained by segmentation is the data fragment as the value of the key-value pair.
  • the segmentation method can be through image recognition or image processing technology.
  • the object contained in an image is a set of tableware. If the key text is “spoon”, the partial image containing the spoon in the image can be used as the data segment of the “spoon”.
  • the entire video can be used as the value of the key-value pair, or a portion of the frame image in the video can be used as the value of the key-value pair, or a local image in the image in the video can be used as the value of the key-value pair.
  • the specific segmentation method of the image and video is not limited.
  • determining the key of the key-value pair according to the key text includes: expanding the key text to obtain a first inquiry text; and determining the first inquiry text as the key of the key-value pair.
  • one or more key texts can be expanded to obtain a first inquiry text.
  • the corresponding key texts include topic A, core view B, keyword C, keyword D, key entity E, and key entity F.
  • topic A can be expanded to obtain the corresponding first inquiry text
  • topic A and core view B can be expanded to obtain the corresponding first inquiry text.
  • one original data can correspond to multiple first inquiry texts.
  • the first inquiry text is determined as the key of the key-value pair, which is the index.
  • the key text may be expanded by inputting the key text into a pre-trained expansion model, so that the key text is expanded into a natural language inquiry text.
  • the expanded first inquiry text may be "How much does the projector weigh?" It can be understood that the expanded first inquiry text includes the key text.
  • the method further includes: encoding the first query text to obtain a coding vector; and determining the coding vector as a key in the key-value pair.
  • the first inquiry text is encoded using an encoder of a pre-trained BERT language model, the first inquiry text is input into the encoder for encoding, and the output is the encoding vector of the first inquiry text.
  • it also includes: performing data mining on the original data to obtain target data; generating a second query text of the target data based on the target data; determining the target data as the value of the key-value pair and determining the second query text as the key of the key-value pair to construct a question-and-answer knowledge base.
  • the data mined from the original data can be combined into key-value pairs and added to the question-answer knowledge base constructed as described above, thereby increasing the amount of data in the question-answer knowledge base.
  • user comment data is mined, and the target data obtained is comment data, and multiple comment data constitute the comment knowledge base.
  • Product attribute data is mined, and the target data obtained is product data, and multiple product data constitute the product knowledge base.
  • Other data is mined, and the target data obtained is general data, and multiple general data constitute the general knowledge base.
  • the key of the target data is determined, and the target data is used as the value, so that the key-value pair can be obtained and added to the question-answer knowledge base constructed in the above manner.
  • the key of the question-answering knowledge base may be a text or a coded vector, but the value corresponding to the key may be a text, an image or a video.
  • the encoding vector b is the encoding vector of the query text a
  • the corresponding value of the two is the text g.
  • the encoding vector d is the encoding vector of the query text c
  • the corresponding value of the two is the image h.
  • the encoding vector f is the encoding vector of the query text e
  • the corresponding value of the two is the video k.
  • the method for constructing a question-and-answer knowledge base can solve the problem of unified retrieval of answer data of different modalities. At the same time, it has strong scalability and can better solve the problem that the method for constructing a question-and-answer knowledge base in related technologies is not flexible enough.
  • This application can quickly construct a question-and-answer knowledge base of 100 million levels, thereby improving the retrieval coverage of online inquiry data.
  • FIG5 is a flowchart of a data retrieval method provided by an exemplary embodiment of the present application, which is applied to a server. Specifically, the following steps are included:
  • the query data is one of text, image or video. If it is an image or video, the image or video can be converted into a query text, and the query text can describe the content of the image or video.
  • the mode of the answer data is one of text, image or video
  • the question-answer knowledge base is constructed according to the above-mentioned knowledge base construction method.
  • answer data corresponding to the query data is retrieved, including: determining that the value in a key-value pair of a key determined based on the query data is the answer data; and/or encoding the query data to obtain a query encoding vector; in the question-and-answer knowledge base, determining a target encoding vector whose similarity with the encoding vector is greater than a threshold; in the question-and-answer knowledge base, determining that the value of the key-value pair with the target encoding vector as the key is the answer data.
  • determining that the value in the key-value pair of the key determined based on the query data is the answer data includes: if the query data sent by the terminal device is text, the corresponding value can be retrieved as the answer data in the question and answer knowledge base using the query data as the key; if the corresponding value cannot be retrieved, the query data is encoded to obtain a coding vector, and the corresponding value is retrieved as the answer data in the question and answer knowledge base using the coding vector as the key.
  • the query data is an image or video
  • the image or video can be extracted and processed to obtain the corresponding query text, and then the query text is used as a key or the encoding vector of the query text is used as a key to retrieve the corresponding value as the answer data.
  • the server can retrieve answer data for the query data based on the knowledge base to provide high-quality answers to the user.
  • FIG6 is a flowchart of another data retrieval method provided by an exemplary embodiment of the present application. Applied to a terminal device, the method specifically includes the following steps:
  • the query data is one of text, image or video.
  • the answer data is determined according to the above-mentioned data retrieval method.
  • a knowledge base construction device 70 in addition to providing a method for constructing a knowledge base, a knowledge base construction device 70 is also provided.
  • the knowledge base construction device 70 includes:
  • An acquisition module 71 is used to acquire multiple pieces of original data, where the modalities of the multiple pieces of original data are at least two of text, image or video;
  • a determination module 72 for determining key text of the original data among the plurality of original data, and determining a key of a key-value pair according to the key text, and determining a value of the key-value pair according to the original data, wherein the key text is used to describe the content of the original data;
  • the construction module 73 is used to construct a question-answer knowledge base according to the key-value pairs corresponding to the multiple original data.
  • the determination module 72 is specifically used to: convert the original data into natural language text; extract key text from the natural language text to obtain key text, and the key text includes: at least one of the content theme, core ideas, keywords, and key entities of the original data.
  • the determination module 72 when extracting key text from natural language text to obtain key text, is specifically used to: input the natural language text into a unified model to extract key text to obtain key text.
  • the determination module 72 is specifically used to: segment the original data based on the key text to obtain data segments corresponding to the key text, and the mode of the data segments is the same as the mode of the original data; determine the value of the key-value pair based on the data segments.
  • the original data is text
  • the determination module 72 segments the original data based on the key text to obtain data segments corresponding to the key text, specifically for: using machine reading comprehension technology to extract data segments describing the key text from the original data.
  • the determination module 72 is specifically used to: expand the key text to obtain a first query text; and determine that the first query text is a key of the key-value pair.
  • the determination module 72 is further used to encode the first query text to obtain an encoding vector; and determine the encoding vector as a key in the key-value pair.
  • the determination module 72 is also used to perform data mining on the original data to obtain target data; generate a second query text for the target data based on the target data; determine the target data as the value of the key-value pair and determine the second query text as the key of the key-value pair to construct a question-and-answer knowledge base.
  • a data retrieval device (not shown) is also provided, and the data retrieval device includes:
  • a receiving module used to receive inquiry data sent by a terminal device, where the inquiry data is one of text, image or video;
  • a retrieval module is used to retrieve answer data corresponding to the query data in the question-answer knowledge base, where the mode of the answer data is one of text, image or video.
  • the question-answer knowledge base is constructed according to the above-mentioned knowledge base construction method;
  • the sending module is used to send answer data to the terminal device.
  • the retrieval module is specifically used to: determine that a value in a key-value pair of a key determined based on the query data is answer data;
  • encode the query data to obtain a query encoding vector; in the question-answering knowledge base, determine a target encoding vector whose similarity with the encoding vector is greater than a threshold; in the question-answering knowledge base, determine that the value of the key-value pair with the target encoding vector as the key is the answer data.
  • the data retrieval device includes:
  • a sending module used for sending query data to the server
  • the receiving module is used to receive answer data of the inquiry data sent by the server, and the answer data is determined by the above-mentioned data retrieval method.
  • a knowledge base construction device which is applied to an intelligent customer service scenario, by obtaining multiple pieces of original data, the modalities of the multiple pieces of original data are at least two of text, image or video; for the original data in the multiple pieces of original data, the key text of the original data is determined, and the key of the key-value pair is determined according to the key text, and the value of the key-value pair is determined according to the original data, and the key text is used to describe the content of the original data; according to the key-value pairs corresponding to the multiple pieces of original data, a question-value pairs corresponding to the multiple pieces of original data, a question-and-answer knowledge base is constructed, and a question-and-answer knowledge base that supports answer retrieval in different modalities can be constructed.
  • FIG8 is a schematic diagram of the structure of a cloud device 80 provided by an exemplary embodiment of the present application.
  • the cloud device 80 is used to run the above-mentioned knowledge base construction method or image processing method.
  • the cloud device includes: a memory 84 and a processor 85 .
  • the memory 84 is used to store computer programs and can be configured to store various other information to support operations on the cloud device.
  • the memory 84 can be an object storage service (OSS).
  • OSS object storage service
  • the memory 84 can be implemented by any type of volatile or nonvolatile storage device or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic disk or optical disk.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read-only memory
  • EPROM erasable programmable read-only memory
  • PROM programmable read-only memory
  • ROM read-only memory
  • magnetic memory magnetic memory
  • flash memory magnetic disk or optical disk.
  • the processor 85 is coupled to the memory 84 and is used to execute the computer program in the memory 84 to: obtain multiple pieces of original data, where the modalities of the multiple pieces of original data are at least two of text, image, or video; determine key text of the original data for the original data in the multiple pieces of original data, determine the key of the key-value pair according to the key text, and determine the value of the key-value pair according to the original data, where the key text is used to describe the content of the original data; and determine the value of the key-value pair according to the multiple pieces of original data. According to the corresponding key-value pairs, a question-answer knowledge base is constructed.
  • the processor 85 is specifically used to: convert the original data into natural language text; extract the key text from the natural language text to obtain the key text, and the key text includes: at least one of the content theme, core ideas, keywords, and key entities of the original data.
  • the processor 85 extracts key text from the natural language text to obtain the key text, it is specifically used to: input the natural language text into a unified model to extract the key text to obtain the key text.
  • the processor 85 when determining the value of the key-value pair based on the original data, is specifically used to: segment the original data based on the key text to obtain data segments corresponding to the key text, the mode of the data segments is the same as the mode of the original data; and determine the value of the key-value pair based on the data segments.
  • the original data is text
  • the processor 85 segments the original data based on the key text to obtain data segments corresponding to the key text, it is specifically used to: use machine reading comprehension technology to extract data segments describing the key text from the original data.
  • the processor 85 is further used to: encode the first query text to obtain a coding vector; and determine the coding vector as a key in the key-value pair.
  • the processor 85 is also used to perform data mining on the original data to obtain target data; generate a second query text of the target data based on the target data; determine the target data as the value of the key-value pair and determine the second query text as the key of the key-value pair to construct a question and answer knowledge base.
  • the processor 85 is coupled to the memory 84, and is used to execute the computer program in the memory 84, so as to: receive query data sent by a terminal device, the query data being one of text, image or video; retrieve answer data corresponding to the query data in a question and answer knowledge base, the modality of the answer data being one of text, image or video, and the question and answer knowledge base is constructed according to any of the above-mentioned knowledge base construction methods; and send the answer data to the terminal device.
  • the processor 85 retrieves answer data corresponding to the query data in the question and answer knowledge base, it is specifically used to: determine that the value in the key-value pair of the key determined based on the query data is the answer data; and/or, encode the query data to obtain a query encoding vector; determine in the question and answer knowledge base a target encoding vector whose similarity with the encoding vector is greater than a threshold.
  • the processor 85 is coupled to the memory 84 and is used to execute the computer program in the memory 84 to: send query data to the server; receive answer data of the query data sent by the server, and the answer data is determined according to the above-mentioned data retrieval method.
  • the cloud device also includes other components such as a firewall 81, a load balancer 82, a communication component 86, and a power supply component 83.
  • Fig. 8 only schematically shows some components, which does not mean that the cloud device only includes the components shown in Fig. 8 .
  • the cloud device provided in the embodiment of the present application can obtain a compressed visual network model, which occupies a smaller memory and has a faster computing efficiency without affecting the recognition accuracy.
  • the embodiment of the present application also provides a computer-readable storage medium storing a computer program.
  • the processor is caused to implement the steps in the above-mentioned method.
  • an embodiment of the present application also provides a computer program product, including a computer program/instruction.
  • the processor is caused to implement the steps in the method shown above.
  • the communication component of Figure 8 above is configured to facilitate wired or wireless communication between the device where the communication component is located and other devices.
  • the device where the communication component is located can access a wireless network based on a communication standard, such as WiFi, 2G, 3G, 4G/LTE, 5G and other mobile communication networks, or a combination thereof.
  • the communication component receives a broadcast signal or broadcast-related text from an external broadcast management system via a broadcast channel.
  • the communication component also includes a near field communication (NFC) module to facilitate short-range communication.
  • the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared information association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
  • RFID radio frequency identification
  • IrDA infrared information association
  • UWB ultra-wideband
  • Bluetooth Bluetooth
  • the power supply assembly of Figure 8 provides power to various components of the device in which the power supply assembly is located.
  • the power supply assembly may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power to the device in which the power supply assembly is located.
  • the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place or distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit may be implemented in the form of hardware or in the form of hardware plus software functional units.
  • the above-mentioned integrated unit implemented in the form of a software functional unit can be stored in a computer-readable storage medium.
  • the above-mentioned software functional unit is stored in a storage medium, including a number of instructions for a computer device (which can be a personal computer, a server, or a network device, etc.) or a processor (processor) to execute some steps of the methods of each embodiment of the present application.
  • the aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), disk or optical disk and other media that can store program codes.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Human Computer Interaction (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The present application provides a knowledge base construction method, a data retrieval method and apparatus, and a cloud device. The knowledge base construction method comprises: obtaining a plurality of pieces of original data, the modalities of the plurality of pieces of original data being at least two of: text, image or video; for original data in the plurality of pieces of original data, determining a key text of the original data, determining the key of a key-value pair according to the key text, and determining the value of the key-value pair according to the original data, wherein the key text is used for describing the content of the original data; according to the key-value pairs corresponding to the plurality of pieces of original data, constructing a question and answer knowledge base, thereby allowing for constructing a question and answer knowledge base supporting different modalities of answer retrieval.

Description

知识库的构建方法、数据检索方法、装置和云设备Knowledge base construction method, data retrieval method, device and cloud device
本申请要求于2023年02月13日提交中国专利局、申请号为202310166877.7、申请名称为“知识库的构建方法、数据检索方法、装置和云设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application filed with the China Patent Office on February 13, 2023, with application number 202310166877.7 and application name “Knowledge base construction method, data retrieval method, device and cloud device”, all contents of which are incorporated by reference in this application.
技术领域Technical Field
本申请涉及计算机技术领域,尤其涉及一种知识库的构建方法、数据检索方法、装置和云设备。The present application relates to the field of computer technology, and in particular to a knowledge base construction method, a data retrieval method, an apparatus and a cloud device.
背景技术Background Art
问答知识库应用于智能客服场景中,其中,根据用户的问询文本在该问答知识库中检索对应的答案文本回复用户。The question-and-answer knowledge base is applied in intelligent customer service scenarios, in which the corresponding answer text is retrieved from the question-and-answer knowledge base based on the user's inquiry text and the user is replied.
相关技术中,通过将原始文本数据中的部分文本作为索引,将该原始文本作为该索引的值,构建问答知识库。但是该构建方法得到的问答知识库不适用于不同模态的答案数据的检索,限制了问答知识库在智能客服场景的应用范围。In the related art, a question-answering knowledge base is constructed by taking part of the text in the original text data as an index and taking the original text as the value of the index. However, the question-answering knowledge base obtained by this construction method is not suitable for the retrieval of answer data of different modalities, which limits the application scope of the question-answering knowledge base in intelligent customer service scenarios.
发明内容Summary of the invention
本申请的多个方面提供一种知识库的构建方法、数据检索方法、装置和云设备,以解决相关技术构建的问答知识库不适用于不同模态的答案数据的检索。Multiple aspects of the present application provide a knowledge base construction method, data retrieval method, apparatus and cloud device to solve the problem that the question-and-answer knowledge base constructed by related technologies is not suitable for retrieval of answer data of different modalities.
本申请实施例第一方面提供一种知识库的构建方法,包括:A first aspect of an embodiment of the present application provides a method for constructing a knowledge base, comprising:
获取多条原始数据,多条原始数据的模态为文本、图像或视频中的至少两种;Acquire multiple pieces of original data, where the modalities of the multiple pieces of original data are at least two of text, image, or video;
针对多条原始数据中的原始数据,确定原始数据的关键文本,并根据关键文本确定键值对的键,根据原始数据确定键值对的值,关键文本用于描述原始数据的内容;For original data in the plurality of original data, determine key text of the original data, determine a key of a key-value pair according to the key text, and determine a value of the key-value pair according to the original data, wherein the key text is used to describe the content of the original data;
根据多条原始数据对应的键值对,构建问答知识库。Build a question-and-answer knowledge base based on the key-value pairs corresponding to multiple pieces of original data.
本申请实施例第二方面提供一种数据检索方法,包括:A second aspect of an embodiment of the present application provides a data retrieval method, comprising:
接收终端设备发送的问询数据,问询数据为文本、图像或视频中的一个;Receiving inquiry data sent by a terminal device, where the inquiry data is one of text, image or video;
在问答知识库中,检索问询数据对应的答案数据,答案数据的模态为文本、图像或者视频中的一个,问答知识库是根据第一方面任一项的知识库的构建方法构建的;In the question-answering knowledge base, answer data corresponding to the query data is retrieved, the modality of the answer data is one of text, image or video, and the question-answering knowledge base is constructed according to the knowledge base construction method of any one of the first aspects;
向终端设备发送答案数据。Send answer data to the terminal device.
本申请实施例第三方面提供一种数据检索方法,应用于终端设备,数据检索方法,包括: A third aspect of the present application embodiment provides a data retrieval method, which is applied to a terminal device. The data retrieval method includes:
向服务器发送问询数据;Send query data to the server;
接收服务器发送的问询数据的答案数据,答案数据是根据第二方面的数据检索方法确定。Receive answer data of the inquiry data sent by the server, the answer data is determined according to the data retrieval method of the second aspect.
本申请实施例第四方面提供一种知识库的构建装置,包括:A fourth aspect of the present application provides a knowledge base construction device, including:
获取模块,用于获取多条原始数据,多条原始数据的模态为文本、图像或视频中的至少两种;An acquisition module, used for acquiring multiple pieces of original data, where the modalities of the multiple pieces of original data are at least two of text, image or video;
确定模块,用于针对多条原始数据中的原始数据,确定原始数据的关键文本,并根据关键文本确定键值对的键,根据原始数据确定键值对的值,关键文本用于描述原始数据的内容;a determination module, for determining key text of the original data among the plurality of original data, determining a key of a key-value pair according to the key text, and determining a value of the key-value pair according to the original data, wherein the key text is used to describe the content of the original data;
构建模块,用于根据多条原始数据对应的键值对,构建问答知识库。The construction module is used to construct a question-answering knowledge base based on the key-value pairs corresponding to multiple pieces of original data.
本申请实施例第五方面提供一种云设备,包括:处理器、存储器及存储在存储器上并可在处理器上运行的计算机程序,处理器执行计算机程序时实现第一方面的知识库的构建方法、第二方面和/或第三方面的数据检索方法。The fifth aspect of an embodiment of the present application provides a cloud device, including: a processor, a memory, and a computer program stored in the memory and executable on the processor, wherein when the processor executes the computer program, the knowledge base construction method of the first aspect, the data retrieval method of the second aspect and/or the third aspect are implemented.
本申请实施例第六方面提供一种计算机可读存储介质,计算机可读存储介质中存储有计算机执行指令,计算机执行指令被处理器执行时用于实现第一方面的知识库的构建方法、第二方面和/或第三方面的数据检索方法。The sixth aspect of the embodiments of the present application provides a computer-readable storage medium, in which computer execution instructions are stored. When the computer execution instructions are executed by a processor, they are used to implement the knowledge base construction method of the first aspect, the data retrieval method of the second aspect and/or the third aspect.
本申请实施例第七方面提供一种计算机程序产品,程序产品包括:计算机程序,计算机程序存储在可读存储介质中,电子设备的至少一个处理器可以从可读存储介质读取计算机程序,至少一个处理器执行计算机程序使得电子设备执行第一方面的知识库的构建方法、第二方面和/或第三方面的数据检索方法。A seventh aspect of an embodiment of the present application provides a computer program product, the program product comprising: a computer program, the computer program is stored in a readable storage medium, at least one processor of an electronic device can read the computer program from the readable storage medium, and at least one processor executes the computer program so that the electronic device executes the knowledge base construction method of the first aspect, the data retrieval method of the second aspect and/or the third aspect.
本申请实施例应用于智能客服场景中,通过获取多条原始数据,多条原始数据的模态为文本、图像或视频中的至少两种;针对多条原始数据中的原始数据,确定原始数据的关键文本,并根据关键文本确定键值对的键,根据原始数据确定键值对的值,关键文本用于描述原始数据的内容;根据多条原始数据对应的键值对,构建问答知识库,能够构建一个支持不同模态的答案检索的问答知识库。The embodiment of the present application is applied in an intelligent customer service scenario, by obtaining multiple pieces of original data, the modalities of the multiple pieces of original data are at least two of text, image or video; for the original data in the multiple pieces of original data, the key text of the original data is determined, and the key of the key-value pair is determined according to the key text, and the value of the key-value pair is determined according to the original data, and the key text is used to describe the content of the original data; according to the key-value pairs corresponding to the multiple pieces of original data, a question-and-answer knowledge base is constructed, and a question-and-answer knowledge base that supports answer retrieval in different modalities can be constructed.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
此处所说明的附图用来提供对本申请的进一步理解,构成本申请的一部分,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。在附图中:The drawings described herein are used to provide a further understanding of the present application and constitute a part of the present application. The illustrative embodiments of the present application and their descriptions are used to explain the present application and do not constitute an improper limitation on the present application. In the drawings:
图1为本申请示例性实施例提供的一种应用场景图;FIG1 is a diagram of an application scenario provided by an exemplary embodiment of the present application;
图2为本申请示例性实施例提供的一种知识库的构建方法的步骤流程图;FIG2 is a flowchart of a method for constructing a knowledge base provided by an exemplary embodiment of the present application;
图3为本申请示例性实施例提供的一种问答知识库的示意图;FIG3 is a schematic diagram of a question-answering knowledge base provided by an exemplary embodiment of the present application;
图4为本申请示例性实施例提供的另一种问答知识库的示意图;FIG4 is a schematic diagram of another question-answering knowledge base provided by an exemplary embodiment of the present application;
图5为本申请示例性实施例提供的一种数据检索方法的步骤流程图; FIG5 is a flowchart of a data retrieval method provided by an exemplary embodiment of the present application;
图6为本申请示例性实施例提供的另一种数据检索方法的步骤流程图;FIG6 is a flowchart of another data retrieval method provided by an exemplary embodiment of the present application;
图7为本申请示例性实施例提供的一种知识库的构建装置的结构框图;FIG7 is a structural block diagram of a knowledge base construction device provided by an exemplary embodiment of the present application;
图8为本申请示例性实施例提供的一种云设备的结构示意图。FIG8 is a schematic diagram of the structure of a cloud device provided by an exemplary embodiment of the present application.
具体实施方式DETAILED DESCRIPTION
为使本申请的目的、技术方案和优点更加清楚,下面将结合本申请具体实施例及相应的附图对本申请技术方案进行清楚、完整地描述。显然,所描述的实施例仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In order to make the purpose, technical solution and advantages of the present application clearer, the technical solution of the present application will be clearly and completely described below in combination with the specific embodiments of the present application and the corresponding drawings. Obviously, the described embodiments are only part of the embodiments of the present application, not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by ordinary technicians in this field without making creative work are within the scope of protection of the present application.
智能客服是一款自动问答产品,在电商领域,面对商家丰富的商品知识和用户可能会问到的海量问题,若要给用户提供高质量的答案,需要问答知识库中有足够的数据。相关技术中,索引(键值对中的键)构建方法是直接构建某个字段的查询索引,但是这种方法在智能客服多源知识场景中,由于智能客服的多源知识通常是不同模态的,每个模态又包括不同类型的数据,随着业务的发展,原始数据的数量也会增加。因此,如何设计一个可以支持不同模态不同类型的答案数据检索的问答知识库,同时构建知识库的方法是可扩展的,能够应用在新的原始数据尤为重要。Intelligent customer service is an automatic question-and-answer product. In the field of e-commerce, facing the merchants’ rich product knowledge and the massive questions that users may ask, if you want to provide users with high-quality answers, you need to have enough data in the question-and-answer knowledge base. In related technologies, the index (key in the key-value pair) construction method is to directly build a query index for a certain field. However, in the multi-source knowledge scenario of intelligent customer service, this method is not suitable because the multi-source knowledge of intelligent customer service is usually in different modalities, and each modality includes different types of data. As the business develops, the amount of raw data will also increase. Therefore, how to design a question-and-answer knowledge base that can support the retrieval of answer data of different modalities and different types, and at the same time, the method of building the knowledge base is scalable and can be applied to new raw data is particularly important.
基于上述背景,本申请实施例提供的知识库的构建方法,包括:获取多条原始数据,多条原始数据的模态为文本、图像或视频中的至少两种;针对多条原始数据中的原始数据,确定原始数据的关键文本,并根据关键文本确定键值对的键,根据原始数据确定键值对的值,关键文本用于描述原始数据的内容;根据多条原始数据对应的键值对,构建问答知识库,本申请能够构建一个支持不同模态的答案检索的问答知识库,并且该构建方法是可扩展的,在有新的原始数据时,可以采用该方法对新的原始数据得到键值对,并入问答知识库中。Based on the above background, the knowledge base construction method provided in the embodiment of the present application includes: obtaining multiple pieces of original data, the modalities of the multiple pieces of original data are at least two of text, image or video; for the original data in the multiple pieces of original data, determining the key text of the original data, and determining the key of the key-value pair according to the key text, and determining the value of the key-value pair according to the original data, the key text is used to describe the content of the original data; constructing a question-and-answer knowledge base according to the key-value pairs corresponding to the multiple pieces of original data. The present application can construct a question-and-answer knowledge base that supports answer retrieval in different modalities, and the construction method is extensible. When there is new original data, this method can be used to obtain key-value pairs for the new original data and incorporate them into the question-and-answer knowledge base.
在本实施例中,可以是借助云计算系统实现整体的知识库的构建方法。此外,知识库的构建方法的服务器可以为云服务器,以便借助于云上资源的优势运行各种神经网络模型;相对于云端,知识库的构建方法也可以应用于常规服务器或服务器阵列等服务端设备,在此不加以限定。In this embodiment, the overall knowledge base construction method can be realized by means of a cloud computing system. In addition, the server of the knowledge base construction method can be a cloud server, so as to run various neural network models with the advantage of cloud resources; relative to the cloud, the knowledge base construction method can also be applied to conventional servers or server arrays and other server-side devices, which are not limited here.
参照图1,为本申请提供的一种应用场景图,其中,包括不同模态的原始数据,如文本、图像或视频。文本数据来源于不同类型的文本,如对话、评论以及图谱等。将这些原始数据进行处理以构建问答知识库,将该问答知识库上线使用后,在接收到用户的问询文本,可以在问答知识库中查找对应的答案数据,将答案数据返回给用户以回复用户的问询文本。Referring to Figure 1, an application scenario diagram provided by the present application includes raw data of different modalities, such as text, images, or videos. The text data comes from different types of text, such as conversations, comments, and graphs. These raw data are processed to build a question-and-answer knowledge base. After the question-and-answer knowledge base is put online for use, upon receiving the user's inquiry text, the corresponding answer data can be found in the question-and-answer knowledge base, and the answer data can be returned to the user to reply to the user's inquiry text.
以下结合附图,详细说明本申请各实施例提供的技术方案。The technical solutions provided by various embodiments of the present application are described in detail below in conjunction with the accompanying drawings.
图2为本申请示例性实施例提供的一种知识库的构建方法的步骤流程图。可以应 用在服务器,如图2所示该知识库的构建方法,具体包括以下步骤:FIG2 is a flowchart of a method for constructing a knowledge base provided by an exemplary embodiment of the present application. The method for constructing the knowledge base used in the server, as shown in FIG2, specifically includes the following steps:
S201,获取多条原始数据。S201, obtaining multiple pieces of original data.
其中,多条原始数据的模态为文本、图像或视频中的至少两种。The modalities of the multiple pieces of original data are at least two of text, image or video.
在本申请实施例中,可以理解,数据库中包括多条原始数据,这多条原始数据来自不同的数据源,数据源的数据为文本模态、图像模态或者视频模态。其中,文本模态的数据源包括:不同类型的数据源,如用户评论数据源、人人对话数据源、商品属性数据源、商品知识图谱数据源、商品详情页数据源以及商品主页数据源。其中,用户评论数据源包括:用户针对某个商品的质量、商品的快递速度或者商家的服务等的评论文本。人人对话数据源包括:用户与商家人工客服的对话文本;商品属性数据源包括:针对商品属性(如类别、大小、颜色)等的文本描述。商品知识图谱数据源包括:节点表示商品,节点之间的联系表示商品之间的联系的知识图谱。商品详情页数据源为图像上包含有文本内容,该文本内容用于描述商品,可以通过OCR(optical character recognition,光符识别)技术识别出这些文本介绍内容,用于作为原始数据。商品主页数据源为图像上包含有文本内容,该文本内容也用于描述商品,也可以通过OCR技术识别出这些文本内容,用于作为原始数据。In the embodiment of the present application, it can be understood that the database includes multiple pieces of raw data, which come from different data sources, and the data of the data source is text mode, image mode or video mode. Among them, the data source of the text mode includes: different types of data sources, such as user comment data source, person-to-person dialogue data source, product attribute data source, product knowledge graph data source, product details page data source and product homepage data source. Among them, the user comment data source includes: user comment text on the quality of a certain product, the express delivery speed of the product or the service of the merchant. The person-to-person dialogue data source includes: the dialogue text between the user and the merchant's manual customer service; the product attribute data source includes: text descriptions of product attributes (such as category, size, color, etc.). The product knowledge graph data source includes: nodes represent products, and the connections between nodes represent the knowledge graph of the connections between products. The product details page data source contains text content on the image, which is used to describe the product. These text introduction contents can be identified by OCR (optical character recognition) technology and used as raw data. The data source of the product homepage is the text content contained in the image, which is also used to describe the product. The text content can also be recognized by OCR technology and used as original data.
在本申请实施例中,图像如包含商品对象的图像,也可以是包含文本的图像,如上述商品详情页、商品主页等。视频如商品的简介视频、商品的使用方式视频或者商品的安装方式视频等。In the embodiment of the present application, the image may include an image of a product object, or may include an image of text, such as the product details page, product homepage, etc. The video may include a product introduction video, a product usage video, or a product installation video.
此外,一条原始数据,如一条评论文本为一条原始数据,一次人人对话为一条原始数据,一段商品属性为一条原始数据,一个知识图谱为一条原始数据,一个图像中识别出的全部文本为一条原始数据,一个图像为一条原始数据,一个视频为一个原始数据。In addition, a piece of raw data, such as a comment text is a piece of raw data, a person-to-person conversation is a piece of raw data, a product attribute is a piece of raw data, a knowledge graph is a piece of raw data, all the text recognized in an image is a piece of raw data, an image is a piece of raw data, and a video is a piece of raw data.
在本申请实施例中,对原始数据的来源不加以限定。In the embodiments of the present application, the source of the original data is not limited.
S202,针对多条原始数据中的原始数据,确定原始数据的关键文本,并根据关键文本确定键值对的键,根据原始数据确定键值对的值。S202, determining key text of the original data among the plurality of original data, determining a key of a key-value pair according to the key text, and determining a value of the key-value pair according to the original data.
在本申请实施例中,一条原始数据对应可以确定至少一个关键文本。其中,关键文本用于描述原始数据的内容。确定原始数据的关键文本,包括:将原始数据转化成自然语言文本;对自然语言文本进行关键文本的抽取,得到关键文本,关键文本包括:原始数据的内容主题、核心观点、关键词、关键实体中的至少一项。In an embodiment of the present application, at least one key text can be determined corresponding to a piece of raw data. The key text is used to describe the content of the raw data. Determining the key text of the raw data includes: converting the raw data into natural language text; extracting the key text from the natural language text to obtain the key text, and the key text includes: at least one of the content theme, core viewpoint, keyword, and key entity of the raw data.
其中,若原始数据为文本,则将原始数据转化成自然语言文本包括:原始数据通过prompt(一种自然语言模型)模版处理,得到自然语言文本。该prompt模版是预先训练的,可以将原始数据转化成自然语言文本。若原始数据为图像或者视频(包括多帧图像),则可以将图像或者视频输入预先训练的多模态识别模型中进行处理,可以得到描述对应图像或视频的自然语言文本。Wherein, if the original data is text, converting the original data into natural language text includes: processing the original data through a prompt (a natural language model) template to obtain natural language text. The prompt template is pre-trained and can convert the original data into natural language text. If the original data is an image or video (including multiple frames of images), the image or video can be input into a pre-trained multimodal recognition model for processing, and a natural language text describing the corresponding image or video can be obtained.
其中,在本申请实施例中,对自然语言文本进行关键文本的抽取,得到关键文本, 包括:将自然语言文本输入统一模型进行关键文本的抽取,得到关键文本。In the embodiment of the present application, key text is extracted from the natural language text to obtain the key text. It includes: inputting natural language text into a unified model to extract key text and obtain key text.
在本申请实施例中,统一模型包括多个预先训练的子模型,能够实现多个NLP(自然语言)任务的统一,如匹配子模型、分类子模型或者排序子模型等。其中匹配子模型用于实现匹配任务,分类子模型用于实现分类任务,排序子模型用于实现排序任务。将自然语言文本输入该统一模型,通过各个子模型的处理,能够得到至少一个关键文本。In the embodiment of the present application, the unified model includes multiple pre-trained sub-models, which can realize the unification of multiple NLP (natural language) tasks, such as matching sub-models, classification sub-models or sorting sub-models. The matching sub-model is used to realize the matching task, the classification sub-model is used to realize the classification task, and the sorting sub-model is used to realize the sorting task. The natural language text is input into the unified model, and at least one key text can be obtained through the processing of each sub-model.
示例性地,通过统一模型可以抽取到原始数据的内容主题、核心观点、关键词、关键实体等核心内容。其中,内容主题,如原始数据是针对商品的或者质量的,核心观点如原始数据是正向数据或负向数据,关键词如原始数据中描述内容主题属性的关键词,关键实体如地点、时间、商品名称或商品类目等。For example, the unified model can extract the core contents of the original data, such as content themes, core ideas, keywords, key entities, etc. Among them, content themes, such as whether the original data is about goods or quality, core ideas, such as whether the original data is positive data or negative data, keywords, such as keywords in the original data that describe the content theme attributes, and key entities, such as location, time, product name or product category, etc.
一种可选实施例中,可以将关键文本作为键值对的键,将自然语言文本或原始数据作为键值对的值,得到键值对,进而构建问答知识库。其中,一条原始数据对应一条自然语言文本,一条自然语言文本可对应一个键,该键为原始数据的关键文本中的一个关键文本或者多个关键文本的组合。In an optional embodiment, the key text can be used as the key of the key-value pair, and the natural language text or original data can be used as the value of the key-value pair to obtain the key-value pair, thereby constructing a question-answering knowledge base. Among them, one piece of original data corresponds to one piece of natural language text, and one piece of natural language text can correspond to one key, which is one key text in the key text of the original data or a combination of multiple key texts.
示例性地,参照图3,原始数据为X,原始数据对应的自然文本为N,原始数据对应的关键文本有主题A、核心观点B、关键词C、关键词D、关键实体E和关键实体F。则构建的问答知识库中包括的键值对如图3。For example, referring to FIG3 , the original data is X, the natural text corresponding to the original data is N, and the key text corresponding to the original data includes topic A, core view B, keyword C, keyword D, key entity E and key entity F. The key-value pairs included in the constructed question-answering knowledge base are shown in FIG3 .
进一步地,根据原始数据确定键值对的值,包括:基于关键文本对原始数据进行切分,得到关键文本对应的数据片段,数据片段的模态与原始数据的模态相同;根据数据片段,确定键值对的值。Furthermore, the value of the key-value pair is determined based on the original data, including: segmenting the original data based on the key text to obtain data segments corresponding to the key text, the mode of the data segments is the same as the mode of the original data; and determining the value of the key-value pair based on the data segments.
此外,原始数据为文本,则基于关键文本对原始数据进行切分,得到关键文本对应的数据片段,包括:采用机器阅读理解技术(MRC,Machine Reading Comprehension),在原始数据中提取描述关键文本的数据片段。In addition, if the original data is text, the original data is segmented based on the key text to obtain data segments corresponding to the key text, including: using machine reading comprehension technology (MRC, Machine Reading Comprehension) to extract data segments describing the key text from the original data.
具体地,是将关键文本和对应的原始数据输入该基于机器阅读理解技术的理解模型中,输出描述关键文本的数据片段。其中,理解模型可以去掉原始数据中的冗余噪声数据,进而能够提高数据片段的整体质量。在本申请实施例中,一关键文本在原始数据中具有对应的数据片段。Specifically, the key text and the corresponding raw data are input into the understanding model based on machine reading comprehension technology, and the data fragments describing the key text are output. Among them, the understanding model can remove the redundant noise data in the raw data, thereby improving the overall quality of the data fragments. In the embodiment of the present application, a key text has a corresponding data fragment in the raw data.
示例性地,若原始数据为“这个投影仪的参数为机体重量1kg,变焦倍数为定焦,光源类型为LED光源”,得到的关键文本有:投影仪、重量、变焦倍数、定焦、光源类型、LED光源。则“投影仪”对应的数据片段可以是整个原始数据,重量对应的数据片段可以是“机体重量1kg”,变焦倍数和/或定焦对应的数据片段为变焦倍数为定焦,光源类型和/或LED光源对应的数据片段为光源类型为LED光源。For example, if the original data is "the parameters of this projector are body weight 1kg, zoom multiple is fixed focus, and light source type is LED light source", the key texts obtained are: projector, weight, zoom multiple, fixed focus, light source type, LED light source. Then the data segment corresponding to "projector" can be the entire original data, the data segment corresponding to weight can be "body weight 1kg", the data segment corresponding to zoom multiple and/or fixed focus is zoom multiple is fixed focus, and the data segment corresponding to light source type and/or LED light source is light source type is LED light source.
在本申请实施例中,若原始数据为图像,则可以将整个图像作为键值对的值,也可以对原始数据进行切分,切分得到的局部图像为数据片段作为键值对的值。切分方式可以通过图像识别或者图像处理技术。例如,一张图像中包含的对象为一套餐具, 关键文本为“勺子”,则可以将该图像中包含勺子的局部图像作为该“勺子”的数据片段。In the embodiment of the present application, if the original data is an image, the entire image can be used as the value of the key-value pair, or the original data can be segmented, and the local image obtained by segmentation is the data fragment as the value of the key-value pair. The segmentation method can be through image recognition or image processing technology. For example, the object contained in an image is a set of tableware. If the key text is “spoon”, the partial image containing the spoon in the image can be used as the data segment of the “spoon”.
进一步地,对于原始数据为视频,则可以将整个视频作为键值对的值,也可以将视频中的部分帧图像作为键值对的值,也可以将视频中的图像中局部图像作为键值对的值。在本申请实施例中,对图像和视频的具体切分方法不加以限定。Furthermore, if the original data is a video, the entire video can be used as the value of the key-value pair, or a portion of the frame image in the video can be used as the value of the key-value pair, or a local image in the image in the video can be used as the value of the key-value pair. In the embodiment of the present application, the specific segmentation method of the image and video is not limited.
进一步地,根据关键文本确定键值对的键,包括:对关键文本进行扩充,得到第一问询文本;确定第一问询文本为键值对的键。Further, determining the key of the key-value pair according to the key text includes: expanding the key text to obtain a first inquiry text; and determining the first inquiry text as the key of the key-value pair.
在本申请实施例中,可以对一个或者多个关键文本进行扩充,得到第一问询文本。例如,对于原始数据X,对应的关键文本有主题A、核心观点B、关键词C、关键词D、关键实体E和关键实体F。则可以对主题A进行扩充,得到对应的第一问询文本;对主题A和核心观点B进行扩充,得到对应的第一问询文本。其中,一个原始数据可以对应得到多个第一问询文本。将该第一问询文本确定为键值对的键,即为索引。In an embodiment of the present application, one or more key texts can be expanded to obtain a first inquiry text. For example, for the original data X, the corresponding key texts include topic A, core view B, keyword C, keyword D, key entity E, and key entity F. Then topic A can be expanded to obtain the corresponding first inquiry text; topic A and core view B can be expanded to obtain the corresponding first inquiry text. Among them, one original data can correspond to multiple first inquiry texts. The first inquiry text is determined as the key of the key-value pair, which is the index.
本申请中,对关键文本的扩充可以将关键文本输入预先训练的扩充模型中扩充,使关键文本扩充为一自然语言的问询文本。例如,若主题A为投影仪,关键词C为重量,则扩充后的第一问询文本可以为“请问投影仪的重量是多少”。可以理解,扩充后的第一问询文本中包括该关键文本。In the present application, the key text may be expanded by inputting the key text into a pre-trained expansion model, so that the key text is expanded into a natural language inquiry text. For example, if the subject A is a projector and the keyword C is weight, the expanded first inquiry text may be "How much does the projector weigh?" It can be understood that the expanded first inquiry text includes the key text.
进一步地,对关键文本进行扩充,得到问询文本之后,还包括:编码第一问询文本,得到编码向量;确定编码向量为键值对中的键。Furthermore, after expanding the key text to obtain the query text, the method further includes: encoding the first query text to obtain a coding vector; and determining the coding vector as a key in the key-value pair.
在本申请实施例中,编码第一问询文本采用预先训练的BERT语言模型的编码器,将第一问询文本输入该编码器进行编码,输出的即为该第一问询文本的编码向量。In an embodiment of the present application, the first inquiry text is encoded using an encoder of a pre-trained BERT language model, the first inquiry text is input into the encoder for encoding, and the output is the encoding vector of the first inquiry text.
可以理解,将编码向量作为键值对中的键(即索引)能够有效弥补问询文本索引对语义内容召回效果差的缺陷。It can be understood that using the encoding vector as the key (ie, index) in the key-value pair can effectively make up for the defect that the query text index has a poor recall effect on semantic content.
S203,根据多条原始数据对应的键值对,构建问答知识库。S203, constructing a question-answer knowledge base according to the key-value pairs corresponding to the multiple pieces of original data.
进一步地,还包括:对原始数据进行数据挖掘,得到目标数据;根据目标数据,生成目标数据的第二问询文本;确定目标数据为键值对的值以及确定第二问询文本为键值对的键,以构建问答知识库。Furthermore, it also includes: performing data mining on the original data to obtain target data; generating a second query text of the target data based on the target data; determining the target data as the value of the key-value pair and determining the second query text as the key of the key-value pair to construct a question-and-answer knowledge base.
在本申请实施例中,可以将原始数据挖掘的数据组成键值对加入上述构建的问答知识库中,提高问答知识库中的数据量。示例性地,对用户评论数据进行挖掘,得到目标数据为评论数据,多条评论数据构成评论知识库。对商品属性数据进行挖掘,得到的目标数据为商品数据,多条商品数据构成商品知识库。对其他数据进行挖掘,得到的目标数据为通用数据,多条通用数据构成通用知识库,针对评论知识库、商品知识库和通用知识库中的目标数据,确定该目标数据的键,并将该目标数据作为值,可以得到键值对加入上述方式构建的问答知识库中。In an embodiment of the present application, the data mined from the original data can be combined into key-value pairs and added to the question-answer knowledge base constructed as described above, thereby increasing the amount of data in the question-answer knowledge base. Exemplarily, user comment data is mined, and the target data obtained is comment data, and multiple comment data constitute the comment knowledge base. Product attribute data is mined, and the target data obtained is product data, and multiple product data constitute the product knowledge base. Other data is mined, and the target data obtained is general data, and multiple general data constitute the general knowledge base. For the target data in the comment knowledge base, product knowledge base, and general knowledge base, the key of the target data is determined, and the target data is used as the value, so that the key-value pair can be obtained and added to the question-answer knowledge base constructed in the above manner.
示例性地,参照图4,为本申请得到的问答知识库的示意图,其中,该问答知识库的键可以是文本或者编码向量,但是该键对应的值可以是文本、图像或者视频。例 如,在图4中,编码向量b是问询文本a的编码向量,二者对应同一值为文本g。编码向量d是问询文本c的编码向量,二者对应同一值为图像h。编码向量f是问询文本e的编码向量,二者对应同一值为视频k。For example, referring to FIG4 , which is a schematic diagram of a question-answering knowledge base obtained in the present application, the key of the question-answering knowledge base may be a text or a coded vector, but the value corresponding to the key may be a text, an image or a video. For example, in FIG4 , the encoding vector b is the encoding vector of the query text a, and the corresponding value of the two is the text g. The encoding vector d is the encoding vector of the query text c, and the corresponding value of the two is the image h. The encoding vector f is the encoding vector of the query text e, and the corresponding value of the two is the video k.
本申请提供的问答知识库的构建方法能够解决不同模态的答案数据的统一检索问题,同时具备较强的可扩展性,能够较好解决相关技术中问答知识库的构建方法不够灵活的问题,本申请能够较快速构建亿级别的问答知识库,提高了对线上问询数据的检索覆盖率。The method for constructing a question-and-answer knowledge base provided in this application can solve the problem of unified retrieval of answer data of different modalities. At the same time, it has strong scalability and can better solve the problem that the method for constructing a question-and-answer knowledge base in related technologies is not flexible enough. This application can quickly construct a question-and-answer knowledge base of 100 million levels, thereby improving the retrieval coverage of online inquiry data.
图5为本申请示例性实施例提供的一种数据检索方法的步骤流程图,应用于服务器。具体包括以下步骤:FIG5 is a flowchart of a data retrieval method provided by an exemplary embodiment of the present application, which is applied to a server. Specifically, the following steps are included:
S501,接收终端设备发送的问询数据。S501, receiving inquiry data sent by a terminal device.
其中,问询数据为文本、图像或视频中的一个。若是图像或视频,则可以将该图像或视频转换为问询文本,该问询文本可以描述该图像或视频的内容。The query data is one of text, image or video. If it is an image or video, the image or video can be converted into a query text, and the query text can describe the content of the image or video.
S502,在问答知识库中,检索问询数据对应的答案数据。S502: Retrieve answer data corresponding to the query data in the question-answer knowledge base.
其中,答案数据的模态为文本、图像或者视频中的一个,问答知识库是根据上述的知识库的构建方法构建的。The mode of the answer data is one of text, image or video, and the question-answer knowledge base is constructed according to the above-mentioned knowledge base construction method.
进一步地,在问答知识库中,检索问询数据对应的答案数据,包括:确定基于问询数据确定的键的键值对中的值为答案数据;和/或,对基于问询数据进行编码,得到问询编码向量;在问答知识库中,确定与编码向量的相似度大于阈值的目标编码向量;在问答知识库中,确定目标编码向量为键的键值对的值为答案数据。Furthermore, in the question-and-answer knowledge base, answer data corresponding to the query data is retrieved, including: determining that the value in a key-value pair of a key determined based on the query data is the answer data; and/or encoding the query data to obtain a query encoding vector; in the question-and-answer knowledge base, determining a target encoding vector whose similarity with the encoding vector is greater than a threshold; in the question-and-answer knowledge base, determining that the value of the key-value pair with the target encoding vector as the key is the answer data.
具体地,确定基于问询数据确定的键的键值对中的值为答案数据包括:若终端设备发送的问询数据为文本,则可以以该问询数据为键在问答知识库中检索对应的值为答案数据,若无法检索到对应的值,则将该问询数据进行编码得到编码向量,以该编码向量为键在问答知识库中检索对应的值为答案数据。Specifically, determining that the value in the key-value pair of the key determined based on the query data is the answer data includes: if the query data sent by the terminal device is text, the corresponding value can be retrieved as the answer data in the question and answer knowledge base using the query data as the key; if the corresponding value cannot be retrieved, the query data is encoded to obtain a coding vector, and the corresponding value is retrieved as the answer data in the question and answer knowledge base using the coding vector as the key.
进一步地,若问询数据为图像或视频,则可以将图像或视频进行提取处理,得到对应的问询文本,然后将该问询文本为键或该问询文本的编码向量为键检索对应的值为答案数据。Furthermore, if the query data is an image or video, the image or video can be extracted and processed to obtain the corresponding query text, and then the query text is used as a key or the encoding vector of the query text is used as a key to retrieve the corresponding value as the answer data.
S503,向终端设备发送答案数据。S503, sending answer data to the terminal device.
在本申请实施例中,服务器可以依据知识库检索到问询数据的答案数据,以给用户提供高质量的答案。In an embodiment of the present application, the server can retrieve answer data for the query data based on the knowledge base to provide high-quality answers to the user.
图6为本申请示例性实施例提供的另一种数据检索方法的步骤流程图。应用于终端设备,具体包括以下步骤:FIG6 is a flowchart of another data retrieval method provided by an exemplary embodiment of the present application. Applied to a terminal device, the method specifically includes the following steps:
S601,向服务器发送问询数据。S601, sending inquiry data to the server.
其中,问询数据为文本、图像或视频中的一个。The query data is one of text, image or video.
S602,接收服务器发送的问询数据的答案数据。S602, receiving answer data of the inquiry data sent by the server.
其中,答案数据是根据上述的数据检索方法确定。 The answer data is determined according to the above-mentioned data retrieval method.
该实施例的具体实现过程参照上述实施例,在此不再赘述。The specific implementation process of this embodiment refers to the above embodiment and will not be repeated here.
在本申请实施例中,参照图7,除了提供知识库的构建方法之外,还提供一种知识库的构建装置70,该知识库的构建装置70包括:In the embodiment of the present application, referring to FIG. 7 , in addition to providing a method for constructing a knowledge base, a knowledge base construction device 70 is also provided. The knowledge base construction device 70 includes:
获取模块71,用于获取多条原始数据,多条原始数据的模态为文本、图像或视频中的至少两种;An acquisition module 71 is used to acquire multiple pieces of original data, where the modalities of the multiple pieces of original data are at least two of text, image or video;
确定模块72,用于针对多条原始数据中的原始数据,确定原始数据的关键文本,并根据关键文本确定键值对的键,根据原始数据确定键值对的值,关键文本用于描述原始数据的内容;A determination module 72, for determining key text of the original data among the plurality of original data, and determining a key of a key-value pair according to the key text, and determining a value of the key-value pair according to the original data, wherein the key text is used to describe the content of the original data;
构建模块73,用于根据多条原始数据对应的键值对,构建问答知识库。The construction module 73 is used to construct a question-answer knowledge base according to the key-value pairs corresponding to the multiple original data.
一种可选实施例中,确定模块72具体用于:将原始数据转化成自然语言文本;对自然语言文本进行关键文本的抽取,得到关键文本,关键文本包括:原始数据的内容主题、核心观点、关键词、关键实体中的至少一项。In an optional embodiment, the determination module 72 is specifically used to: convert the original data into natural language text; extract key text from the natural language text to obtain key text, and the key text includes: at least one of the content theme, core ideas, keywords, and key entities of the original data.
在一可选实施例中,确定模块72在对自然语言文本进行关键文本的抽取,得到关键文本时,具体用于:将自然语言文本输入统一模型进行关键文本的抽取,得到关键文本。In an optional embodiment, when extracting key text from natural language text to obtain key text, the determination module 72 is specifically used to: input the natural language text into a unified model to extract key text to obtain key text.
在一可选实施例中,确定模块72具体用于:基于关键文本对原始数据进行切分,得到关键文本对应的数据片段,数据片段的模态与原始数据的模态相同;根据数据片段,确定键值对的值。In an optional embodiment, the determination module 72 is specifically used to: segment the original data based on the key text to obtain data segments corresponding to the key text, and the mode of the data segments is the same as the mode of the original data; determine the value of the key-value pair based on the data segments.
在一可选实施例中,原始数据为文本,则确定模块72在基于关键文本对原始数据进行切分,得到关键文本对应的数据片段,具体用于:采用机器阅读理解技术,在原始数据中提取描述关键文本的数据片段。In an optional embodiment, the original data is text, and the determination module 72 segments the original data based on the key text to obtain data segments corresponding to the key text, specifically for: using machine reading comprehension technology to extract data segments describing the key text from the original data.
在一可选实施例中,确定模块72具体用于:对关键文本进行扩充,得到第一问询文本;确定第一问询文本为键值对的键。In an optional embodiment, the determination module 72 is specifically used to: expand the key text to obtain a first query text; and determine that the first query text is a key of the key-value pair.
在一可选实施例中,确定模块72还用于,编码第一问询文本,得到编码向量;确定编码向量为键值对中的键。In an optional embodiment, the determination module 72 is further used to encode the first query text to obtain an encoding vector; and determine the encoding vector as a key in the key-value pair.
在一可选实施例中,确定模块72还用于,对原始数据进行数据挖掘,得到目标数据;根据目标数据,生成目标数据的第二问询文本;确定目标数据为键值对的值以及确定第二问询文本为键值对的键,以构建问答知识库。In an optional embodiment, the determination module 72 is also used to perform data mining on the original data to obtain target data; generate a second query text for the target data based on the target data; determine the target data as the value of the key-value pair and determine the second query text as the key of the key-value pair to construct a question-and-answer knowledge base.
在本申请实施例中,还提供一种数据检索装置(未示出),该数据检索装置包括:In an embodiment of the present application, a data retrieval device (not shown) is also provided, and the data retrieval device includes:
接收模块,用于接收终端设备发送的问询数据,问询数据为文本、图像或视频中的一个;A receiving module, used to receive inquiry data sent by a terminal device, where the inquiry data is one of text, image or video;
检索模块,用于在问答知识库中,检索问询数据对应的答案数据,答案数据的模态为文本、图像或者视频中的一个,问答知识库是根据上述的知识库的构建方法构建的;A retrieval module is used to retrieve answer data corresponding to the query data in the question-answer knowledge base, where the mode of the answer data is one of text, image or video. The question-answer knowledge base is constructed according to the above-mentioned knowledge base construction method;
发送模块,用于向终端设备发送答案数据。 The sending module is used to send answer data to the terminal device.
在一可选实施例中,检索模块具体用于:确定基于问询数据确定的键的键值对中的值为答案数据;In an optional embodiment, the retrieval module is specifically used to: determine that a value in a key-value pair of a key determined based on the query data is answer data;
和/或,对基于问询数据进行编码,得到问询编码向量;在问答知识库中,确定与编码向量的相似度大于阈值的目标编码向量;在问答知识库中,确定目标编码向量为键的键值对的值为答案数据。And/or, encode the query data to obtain a query encoding vector; in the question-answering knowledge base, determine a target encoding vector whose similarity with the encoding vector is greater than a threshold; in the question-answering knowledge base, determine that the value of the key-value pair with the target encoding vector as the key is the answer data.
在本申请实施例中,还提供另一种数据检索装置(未示出),应用于终端设备,该数据检索装置包括:In an embodiment of the present application, another data retrieval device (not shown) is further provided and applied to a terminal device. The data retrieval device includes:
发送模块,用于向服务器发送问询数据;A sending module, used for sending query data to the server;
接收模块,用于接收服务器发送的问询数据的答案数据,答案数据是上述的数据检索方法确定。The receiving module is used to receive answer data of the inquiry data sent by the server, and the answer data is determined by the above-mentioned data retrieval method.
在本申请实施例中,提供一种知识库的构建装置,应用于智能客服场景中,通过获取多条原始数据,多条原始数据的模态为文本、图像或视频中的至少两种;针对多条原始数据中的原始数据,确定原始数据的关键文本,并根据关键文本确定键值对的键,根据原始数据确定键值对的值,关键文本用于描述原始数据的内容;根据多条原始数据对应的键值对,构建问答知识库,能够构建一个支持不同模态的答案检索的问答知识库。In an embodiment of the present application, a knowledge base construction device is provided, which is applied to an intelligent customer service scenario, by obtaining multiple pieces of original data, the modalities of the multiple pieces of original data are at least two of text, image or video; for the original data in the multiple pieces of original data, the key text of the original data is determined, and the key of the key-value pair is determined according to the key text, and the value of the key-value pair is determined according to the original data, and the key text is used to describe the content of the original data; according to the key-value pairs corresponding to the multiple pieces of original data, a question-and-answer knowledge base is constructed, and a question-and-answer knowledge base that supports answer retrieval in different modalities can be constructed.
另外,在上述实施例及附图中的描述的一些流程中,包含了按照特定顺序出现的多个操作,但是应该清楚了解,这些操作可以不按照其在本文中出现的顺序来执行或并行执行,仅仅是用于区分开各个不同的操作,序号本身不代表任何的执行顺序。另外,这些流程可以包括更多或更少的操作,并且这些操作可以按顺序执行或并行执行。需要说明的是,本文中的“第一”、“第二”等描述,是用于区分不同的消息、设备、模块等,不代表先后顺序,也不限定“第一”和“第二”是不同的类型。In addition, in some of the processes described in the above embodiments and the accompanying drawings, multiple operations that appear in a specific order are included, but it should be clearly understood that these operations may not be executed in the order in which they appear in this article or executed in parallel, and are only used to distinguish between different operations, and the sequence number itself does not represent any execution order. In addition, these processes may include more or fewer operations, and these operations may be executed in sequence or in parallel. It should be noted that the descriptions of "first", "second", etc. in this article are used to distinguish different messages, devices, modules, etc., do not represent the order of precedence, and do not limit the "first" and "second" to be different types.
图8为本申请示例性实施例提供的一种云设备80的结构示意图。该云设备80用于运行上述知识库的构建方法或图像处理方法。如图8所示,该云设备包括:存储器84和处理器85。FIG8 is a schematic diagram of the structure of a cloud device 80 provided by an exemplary embodiment of the present application. The cloud device 80 is used to run the above-mentioned knowledge base construction method or image processing method. As shown in FIG8 , the cloud device includes: a memory 84 and a processor 85 .
存储器84,用于存储计算机程序,并可被配置为存储其它各种信息以支持在云设备上的操作。该存储器84可以是对象存储(Object Storage Service,OSS)。The memory 84 is used to store computer programs and can be configured to store various other information to support operations on the cloud device. The memory 84 can be an object storage service (OSS).
存储器84可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。The memory 84 can be implemented by any type of volatile or nonvolatile storage device or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic disk or optical disk.
处理器85,与存储器84耦合,用于执行存储器84中的计算机程序,以用于:获取多条原始数据,多条原始数据的模态为文本、图像或视频中的至少两种;针对多条原始数据中的原始数据,确定原始数据的关键文本,并根据关键文本确定键值对的键,根据原始数据确定键值对的值,关键文本用于描述原始数据的内容;根据多条原始数 据对应的键值对,构建问答知识库。The processor 85 is coupled to the memory 84 and is used to execute the computer program in the memory 84 to: obtain multiple pieces of original data, where the modalities of the multiple pieces of original data are at least two of text, image, or video; determine key text of the original data for the original data in the multiple pieces of original data, determine the key of the key-value pair according to the key text, and determine the value of the key-value pair according to the original data, where the key text is used to describe the content of the original data; and determine the value of the key-value pair according to the multiple pieces of original data. According to the corresponding key-value pairs, a question-answer knowledge base is constructed.
进一步可选地,处理器85在确定原始数据的关键文本时,具体用于:将原始数据转化成自然语言文本;对自然语言文本进行关键文本的抽取,得到关键文本,关键文本包括:原始数据的内容主题、核心观点、关键词、关键实体中的至少一项。Further optionally, when determining the key text of the original data, the processor 85 is specifically used to: convert the original data into natural language text; extract the key text from the natural language text to obtain the key text, and the key text includes: at least one of the content theme, core ideas, keywords, and key entities of the original data.
进一步可选地,处理器85在对自然语言文本进行关键文本的抽取,得到关键文本时,具体用于:将自然语言文本输入统一模型进行关键文本的抽取,得到关键文本。Further optionally, when the processor 85 extracts key text from the natural language text to obtain the key text, it is specifically used to: input the natural language text into a unified model to extract the key text to obtain the key text.
进一步可选地,处理器85在根据原始数据确定键值对的值时,具体用于:基于关键文本对原始数据进行切分,得到关键文本对应的数据片段,数据片段的模态与原始数据的模态相同;根据数据片段,确定键值对的值。Further optionally, when determining the value of the key-value pair based on the original data, the processor 85 is specifically used to: segment the original data based on the key text to obtain data segments corresponding to the key text, the mode of the data segments is the same as the mode of the original data; and determine the value of the key-value pair based on the data segments.
进一步可选地,原始数据为文本,处理器85在基于关键文本对原始数据进行切分,得到关键文本对应的数据片段时,具体用于:采用机器阅读理解技术,在原始数据中提取描述关键文本的数据片段。Further optionally, the original data is text, and when the processor 85 segments the original data based on the key text to obtain data segments corresponding to the key text, it is specifically used to: use machine reading comprehension technology to extract data segments describing the key text from the original data.
进一步可选地,处理器85在对关键文本进行扩充,得到问询文本之后,还用于:编码第一问询文本,得到编码向量;确定编码向量为键值对中的键。Further optionally, after expanding the key text to obtain the query text, the processor 85 is further used to: encode the first query text to obtain a coding vector; and determine the coding vector as a key in the key-value pair.
进一步可选地,处理器85还用于对原始数据进行数据挖掘,得到目标数据;根据目标数据,生成目标数据的第二问询文本;确定目标数据为键值对的值以及确定第二问询文本为键值对的键,以构建问答知识库。Further optionally, the processor 85 is also used to perform data mining on the original data to obtain target data; generate a second query text of the target data based on the target data; determine the target data as the value of the key-value pair and determine the second query text as the key of the key-value pair to construct a question and answer knowledge base.
在一可选实施例中,处理器85,与存储器84耦合,用于执行存储器84中的计算机程序,以用于:接收终端设备发送的问询数据,问询数据为文本、图像或视频中的一个;在问答知识库中,检索问询数据对应的答案数据,答案数据的模态为文本、图像或者视频中的一个,问答知识库是根据上述任一项的知识库的构建方法构建的;向终端设备发送答案数据。In an optional embodiment, the processor 85 is coupled to the memory 84, and is used to execute the computer program in the memory 84, so as to: receive query data sent by a terminal device, the query data being one of text, image or video; retrieve answer data corresponding to the query data in a question and answer knowledge base, the modality of the answer data being one of text, image or video, and the question and answer knowledge base is constructed according to any of the above-mentioned knowledge base construction methods; and send the answer data to the terminal device.
进一步可选地,处理器85在问答知识库中,检索问询数据对应的答案数据时,具体用于:确定基于问询数据确定的键的键值对中的值为答案数据;和/或,对基于问询数据进行编码,得到问询编码向量;在问答知识库中,确定与编码向量的相似度大于阈值的目标编码向量。Further optionally, when the processor 85 retrieves answer data corresponding to the query data in the question and answer knowledge base, it is specifically used to: determine that the value in the key-value pair of the key determined based on the query data is the answer data; and/or, encode the query data to obtain a query encoding vector; determine in the question and answer knowledge base a target encoding vector whose similarity with the encoding vector is greater than a threshold.
在一可选实施例中,处理器85,与存储器84耦合,用于执行存储器84中的计算机程序,以用于:向服务器发送问询数据;接收服务器发送的问询数据的答案数据,答案数据是根据上述的数据检索方法确定。In an optional embodiment, the processor 85 is coupled to the memory 84 and is used to execute the computer program in the memory 84 to: send query data to the server; receive answer data of the query data sent by the server, and the answer data is determined according to the above-mentioned data retrieval method.
进一步地,如图8,该云设备还包括:防火墙81、负载均衡器82、通信组件86、电源组件83等其它组件。图8中仅示意性给出部分组件,并不意味着云设备只包括图8所示组件。Furthermore, as shown in Fig. 8 , the cloud device also includes other components such as a firewall 81, a load balancer 82, a communication component 86, and a power supply component 83. Fig. 8 only schematically shows some components, which does not mean that the cloud device only includes the components shown in Fig. 8 .
本申请实施例提供的云设备,能够实现得到压缩后的视觉网络模型,该压缩后的视觉网络模型在不影响识别精度的情况下,占有较小的内存以及具有较快的计算效率。The cloud device provided in the embodiment of the present application can obtain a compressed visual network model, which occupies a smaller memory and has a faster computing efficiency without affecting the recognition accuracy.
相应地,本申请实施例还提供一种存储有计算机程序的计算机可读存储介质,当 计算机程序/指令被处理器执行时,致使处理器实现上述所示方法中的步骤。Accordingly, the embodiment of the present application also provides a computer-readable storage medium storing a computer program. When the computer program/instructions are executed by a processor, the processor is caused to implement the steps in the above-mentioned method.
相应地,本申请实施例还提供一种计算机程序产品,包括计算机程序/指令,当计算机程序/指令被处理器执行时,致使处理器实现上述所示方法中的步骤。Accordingly, an embodiment of the present application also provides a computer program product, including a computer program/instruction. When the computer program/instruction is executed by a processor, the processor is caused to implement the steps in the method shown above.
上述图8的通信组件被配置为便于通信组件所在设备和其他设备之间有线或无线方式的通信。通信组件所在设备可以接入基于通信标准的无线网络,如WiFi,2G、3G、4G/LTE、5G等移动通信网络,或它们的组合。在一个示例性实施例中,通信组件经由广播信道接收来自外部广播管理系统的广播信号或广播相关文本。在一个示例性实施例中,通信组件还包括近场通信(NFC)模块,以促进短程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外信息协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。The communication component of Figure 8 above is configured to facilitate wired or wireless communication between the device where the communication component is located and other devices. The device where the communication component is located can access a wireless network based on a communication standard, such as WiFi, 2G, 3G, 4G/LTE, 5G and other mobile communication networks, or a combination thereof. In an exemplary embodiment, the communication component receives a broadcast signal or broadcast-related text from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component also includes a near field communication (NFC) module to facilitate short-range communication. For example, the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared information association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
上述图8的电源组件,为电源组件所在设备的各种组件提供电力。电源组件可以包括电源管理系统,一个或多个电源,及其他与为电源组件所在设备生成、管理和分配电力相关联的组件。The power supply assembly of Figure 8 provides power to various components of the device in which the power supply assembly is located. The power supply assembly may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power to the device in which the power supply assembly is located.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统和方法,可以通过其它的方式实现。例如,以上所描述的系统实施例仅仅是示意性的,例如,单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,系统或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in the present application, it should be understood that the disclosed systems and methods can be implemented in other ways. For example, the system embodiments described above are only schematic, for example, the division of units is only a logical function division, and there may be other division methods in actual implementation, such as multiple units or components can be combined or integrated into another system, or some features can be ignored or not executed. Another point is that the mutual coupling or direct coupling or communication connection shown or discussed can be through some interfaces, indirect coupling or communication connection of systems or units, which can be electrical, mechanical or other forms.
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place or distributed on multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用硬件加软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically separately, or two or more units may be integrated into one unit. The above-mentioned integrated unit may be implemented in the form of hardware or in the form of hardware plus software functional units.
上述以软件功能单元的形式实现的集成的单元,可以存储在一个计算机可读取存储介质中。上述软件功能单元存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本申请各个实施例方法的部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、随机存取存储器(Random Access Memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The above-mentioned integrated unit implemented in the form of a software functional unit can be stored in a computer-readable storage medium. The above-mentioned software functional unit is stored in a storage medium, including a number of instructions for a computer device (which can be a personal computer, a server, or a network device, etc.) or a processor (processor) to execute some steps of the methods of each embodiment of the present application. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), disk or optical disk and other media that can store program codes.
本领域技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将系统的内部结构划分成不同的功能模块,以完成以上描述的全部或者部 分功能。上述描述的系统的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and simplicity of description, only the division of the above functional modules is used as an example for illustration. In actual applications, the above functions can be assigned to different functional modules as needed, that is, the internal structure of the system can be divided into different functional modules to complete all or part of the above description. The specific working process of the system described above can refer to the corresponding process in the aforementioned method embodiment, and will not be repeated here.
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本申请的其它实施方案。本申请旨在涵盖本申请的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本申请的一般性原理并包括本申请未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本申请的真正范围和精神由下面的权利要求书指出。Those skilled in the art will readily appreciate other embodiments of the present application after considering the specification and practicing the invention disclosed herein. The present application is intended to cover any modification, use or adaptation of the present application, which follows the general principles of the present application and includes common knowledge or customary techniques in the art that are not disclosed in the present application. The specification and examples are intended to be exemplary only, and the true scope and spirit of the present application are indicated by the following claims.
应当理解的是,本申请并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本申请的范围仅由所附的权利要求书来限制。 It should be understood that the present application is not limited to the precise structures that have been described above and shown in the drawings, and that various modifications and changes may be made without departing from the scope thereof. The scope of the present application is limited only by the appended claims.

Claims (13)

  1. 一种知识库的构建方法,其特征在于,包括:A method for constructing a knowledge base, characterized by comprising:
    获取多条原始数据,所述多条原始数据的模态为文本、图像或视频中的至少两种;Acquire multiple pieces of original data, where the modalities of the multiple pieces of original data are at least two of text, image, or video;
    针对所述多条原始数据中的原始数据,确定所述原始数据的关键文本,并根据所述关键文本确定键值对的键,根据所述原始数据确定所述键值对的值,所述关键文本用于描述所述原始数据的内容;For original data among the plurality of original data, determine key text of the original data, determine a key of a key-value pair according to the key text, and determine a value of the key-value pair according to the original data, wherein the key text is used to describe content of the original data;
    根据所述多条原始数据对应的键值对,构建问答知识库。A question-and-answer knowledge base is constructed based on the key-value pairs corresponding to the multiple pieces of original data.
  2. 根据权利要求1所述的知识库的构建方法,其特征在于,所述确定所述原始数据的关键文本,包括:The method for constructing a knowledge base according to claim 1, wherein determining the key text of the original data comprises:
    将所述原始数据转化成自然语言文本;Converting the raw data into natural language text;
    对所述自然语言文本进行关键文本的抽取,得到所述关键文本,所述关键文本包括:所述原始数据的内容主题、核心观点、关键词、关键实体中的至少一项。Extract key text from the natural language text to obtain the key text, wherein the key text includes at least one of the content theme, core viewpoint, keywords, and key entities of the original data.
  3. 根据权利要求2所述的知识库的构建方法,其特征在于,所述对所述自然语言文本进行关键文本的抽取,得到所述关键文本,包括:The method for constructing a knowledge base according to claim 2, characterized in that extracting key text from the natural language text to obtain the key text comprises:
    将所述自然语言文本输入统一模型进行关键文本的抽取,得到所述关键文本。The natural language text is input into a unified model to extract key text to obtain the key text.
  4. 根据权利要求1至3任一项所述的知识库的构建方法,其特征在于,所述根据所述原始数据确定所述键值对的值,包括:The method for constructing a knowledge base according to any one of claims 1 to 3, characterized in that determining the value of the key-value pair according to the original data comprises:
    基于所述关键文本对所述原始数据进行切分,得到所述关键文本对应的数据片段,所述数据片段的模态与所述原始数据的模态相同;Segmenting the original data based on the key text to obtain data segments corresponding to the key text, wherein the modality of the data segments is the same as the modality of the original data;
    根据所述数据片段,确定所述键值对的值。According to the data segment, a value of the key-value pair is determined.
  5. 根据权利要求4所述的知识库的构建方法,其特征在于,所述原始数据为文本,则所述基于所述关键文本对所述原始数据进行切分,得到所述关键文本对应的数据片段,包括:The method for constructing a knowledge base according to claim 4, wherein the original data is text, and the segmenting of the original data based on the key text to obtain data segments corresponding to the key text includes:
    采用机器阅读理解技术,在所述原始数据中提取描述所述关键文本的数据片段。Machine reading comprehension technology is used to extract data segments describing the key text from the original data.
  6. 根据权利要求1至3任一项所述的知识库的构建方法,其特征在于,所述根据所述关键文本确定键值对的键,包括:The method for constructing a knowledge base according to any one of claims 1 to 3, characterized in that the step of determining the key of the key-value pair according to the key text comprises:
    对所述关键文本进行扩充,得到第一问询文本;Expanding the key text to obtain a first inquiry text;
    确定所述第一问询文本为所述键值对的键。The first query text is determined to be a key of the key-value pair.
  7. 根据权利要求6所述的知识库的构建方法,其特征在于,所述对所述关键文本进行扩充,得到问询文本之后,还包括:The method for constructing a knowledge base according to claim 6, characterized in that after the key text is expanded to obtain the query text, it also includes:
    编码所述第一问询文本,得到编码向量;Encoding the first query text to obtain an encoding vector;
    确定所述编码向量为所述键值对中的键。The encoding vector is determined to be a key in the key-value pair.
  8. 根据权利要求1至3任一项所述的知识库构的建方法,其特征在于,还包括:The method for constructing a knowledge base according to any one of claims 1 to 3, further comprising:
    对所述原始数据进行数据挖掘,得到目标数据;Performing data mining on the original data to obtain target data;
    根据所述目标数据,生成所述目标数据的第二问询文本; generating a second query text for the target data according to the target data;
    确定所述目标数据为键值对的值以及确定所述第二问询文本为所述键值对的键,以构建所述问答知识库。The target data is determined to be the value of a key-value pair and the second inquiry text is determined to be the key of the key-value pair to construct the question-answer knowledge base.
  9. 一种数据检索方法,其特征在于,包括:A data retrieval method, characterized by comprising:
    接收终端设备发送的问询数据,所述问询数据为文本、图像或视频中的一个;Receiving inquiry data sent by a terminal device, wherein the inquiry data is one of text, image or video;
    在问答知识库中,检索所述问询数据对应的答案数据,所述答案数据的模态为文本、图像或者视频中的一个,所述问答知识库是根据权利要求1至8任一项所述的知识库的构建方法构建的;In a question-and-answer knowledge base, answer data corresponding to the query data is retrieved, wherein the modality of the answer data is one of text, image or video, and the question-and-answer knowledge base is constructed according to the method for constructing a knowledge base according to any one of claims 1 to 8;
    向所述终端设备发送所述答案数据。The answer data is sent to the terminal device.
  10. 根据权利要求9所述的数据检索方法,其特征在于,所述在问答知识库中,检索问询数据对应的答案数据,包括:The data retrieval method according to claim 9, characterized in that the step of retrieving answer data corresponding to the query data in the question-answer knowledge base comprises:
    确定基于所述问询数据确定的键的键值对中的值为所述答案数据;Determine that a value in a key-value pair of a key determined based on the query data is the answer data;
    和/或,and/or,
    对所述基于所述问询数据进行编码,得到问询编码向量;Encoding the query data to obtain a query encoding vector;
    在所述问答知识库中,确定与所述编码向量的相似度大于阈值的目标编码向量;In the question-answer knowledge base, determining a target encoding vector having a similarity with the encoding vector greater than a threshold;
    在所述问答知识库中,确定所述目标编码向量为键的键值对的值为所述答案数据。In the question-answer knowledge base, it is determined that the value of the key-value pair with the target encoding vector as the key is the answer data.
  11. 一种数据检索方法,其特征在于,应用于终端设备,所述数据检索方法,包括:A data retrieval method, characterized in that it is applied to a terminal device, and the data retrieval method comprises:
    向服务器发送问询数据;Send query data to the server;
    接收服务器发送的所述问询数据的答案数据,所述答案数据是根据权利要求9或10所述的数据检索方法确定。Receive answer data of the inquiry data sent by the server, wherein the answer data is determined according to the data retrieval method according to claim 9 or 10.
  12. 一种知识库的构建装置,其特征在于,包括:A device for constructing a knowledge base, characterized by comprising:
    获取模块,用于获取多条原始数据,所述多条原始数据的模态为文本、图像或视频中的至少两种;An acquisition module, used to acquire multiple pieces of original data, wherein the modalities of the multiple pieces of original data are at least two of text, image or video;
    确定模块,用于针对所述多条原始数据中的原始数据,确定所述原始数据的关键文本,并根据所述关键文本确定键值对的键,根据所述原始数据确定所述键值对的值,所述关键文本用于描述所述原始数据的内容;a determination module, configured to determine, for original data among the plurality of original data, a key text of the original data, determine a key of a key-value pair according to the key text, and determine a value of the key-value pair according to the original data, wherein the key text is used to describe the content of the original data;
    构建模块,用于根据所述多条原始数据对应的键值对,构建问答知识库。A construction module is used to construct a question-and-answer knowledge base based on the key-value pairs corresponding to the multiple pieces of original data.
  13. 一种云设备,其特征在于,包括:处理器、存储器及存储在所述存储器上并可在处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如权利要求1至8中任一项所述知识库的构建方法,和/或权利要求9至11任一项所述的数据检索方法。 A cloud device, characterized in that it comprises: a processor, a memory, and a computer program stored in the memory and executable on the processor, wherein when the processor executes the computer program, it implements the knowledge base construction method as described in any one of claims 1 to 8, and/or the data retrieval method as described in any one of claims 9 to 11.
PCT/CN2024/073350 2023-02-13 2024-01-19 Knowledge base construction method, data retrieval method and apparatus, and cloud device WO2024169529A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202310166877.7 2023-02-13
CN202310166877.7A CN116340479A (en) 2023-02-13 2023-02-13 Knowledge base construction method, data retrieval method, device and cloud equipment

Publications (1)

Publication Number Publication Date
WO2024169529A1 true WO2024169529A1 (en) 2024-08-22

Family

ID=86875545

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2024/073350 WO2024169529A1 (en) 2023-02-13 2024-01-19 Knowledge base construction method, data retrieval method and apparatus, and cloud device

Country Status (2)

Country Link
CN (1) CN116340479A (en)
WO (1) WO2024169529A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116340479A (en) * 2023-02-13 2023-06-27 阿里巴巴(中国)有限公司 Knowledge base construction method, data retrieval method, device and cloud equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111177532A (en) * 2019-12-02 2020-05-19 平安资产管理有限责任公司 Vertical search method, device, computer system and readable storage medium
CN113077893A (en) * 2021-04-23 2021-07-06 上海理工大学 Intelligent assistive device adaptive decision making system and method
CN116340479A (en) * 2023-02-13 2023-06-27 阿里巴巴(中国)有限公司 Knowledge base construction method, data retrieval method, device and cloud equipment

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111177532A (en) * 2019-12-02 2020-05-19 平安资产管理有限责任公司 Vertical search method, device, computer system and readable storage medium
CN113077893A (en) * 2021-04-23 2021-07-06 上海理工大学 Intelligent assistive device adaptive decision making system and method
CN116340479A (en) * 2023-02-13 2023-06-27 阿里巴巴(中国)有限公司 Knowledge base construction method, data retrieval method, device and cloud equipment

Also Published As

Publication number Publication date
CN116340479A (en) 2023-06-27

Similar Documents

Publication Publication Date Title
RU2745632C1 (en) Automated response server device, terminal device, response system, response method and program
US11574145B2 (en) Cross-modal weak supervision for media classification
US20210232761A1 (en) Methods and systems for improving machine learning performance
US10341461B2 (en) System and method for automatically recreating personal media through fusion of multimodal features
CN112307762B (en) Search result sorting method and device, storage medium and electronic device
CN108304376B (en) Text vector determination method and device, storage medium and electronic device
WO2024169529A1 (en) Knowledge base construction method, data retrieval method and apparatus, and cloud device
WO2025016255A1 (en) Task processing method and automatic question answering method
CN117725220A (en) Method, server and storage medium for document characterization and document retrieval
KR20220130863A (en) Apparatus for Providing Multimedia Conversion Content Creation Service Based on Voice-Text Conversion Video Resource Matching
CN112948251B (en) Automatic software testing method and device
KR20220079026A (en) A apparatus for providing general document-based multimedia image content production service
CN112000813B (en) Knowledge base construction method and device
KR20220130860A (en) Operation method of a service providing device that converts voice information into multimedia video content
CN116756576B (en) Data processing method, model training method, electronic device and storage medium
WO2024250814A1 (en) Modal hashing network training method and apparatus, cross-modal retrieval method and apparatus, computer device, storage medium, and program product
US20230056131A1 (en) Server and method for classifying entities of a query
CN110597765A (en) Large retail call center heterogeneous data source data processing method and device
CN115718904A (en) Text processing method and device
KR20220079042A (en) Program recorded medium for providing service
KR20220079029A (en) Method for providing automatic document-based multimedia content creation service
CN116521884A (en) Object information extraction method and device, storage medium and electronic equipment
CN116756676A (en) Abstract generation method and related device
CN113535125A (en) Financial demand item generation method and device
KR102435242B1 (en) An apparatus for providing a producing service of transformed multimedia contents using matching of video resources

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 24755904

Country of ref document: EP

Kind code of ref document: A1