CN111339284A - Product intelligent matching method, device, device and readable storage medium - Google Patents
Product intelligent matching method, device, device and readable storage medium Download PDFInfo
- Publication number
- CN111339284A CN111339284A CN202010090301.3A CN202010090301A CN111339284A CN 111339284 A CN111339284 A CN 111339284A CN 202010090301 A CN202010090301 A CN 202010090301A CN 111339284 A CN111339284 A CN 111339284A
- Authority
- CN
- China
- Prior art keywords
- user
- information
- product
- preset
- matching
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 50
- 230000008451 emotion Effects 0.000 claims abstract description 80
- 230000006978 adaptation Effects 0.000 claims abstract description 36
- 230000008909 emotion recognition Effects 0.000 claims abstract description 20
- 230000004044 response Effects 0.000 claims abstract description 16
- 230000002996 emotional effect Effects 0.000 claims description 36
- 238000005516 engineering process Methods 0.000 claims description 15
- 230000014509 gene expression Effects 0.000 claims description 14
- 238000000605 extraction Methods 0.000 claims description 10
- 238000012545 processing Methods 0.000 claims description 8
- 208000028399 Critical Illness Diseases 0.000 description 22
- 238000010586 diagram Methods 0.000 description 16
- 238000004458 analytical method Methods 0.000 description 10
- 238000004891 communication Methods 0.000 description 10
- 230000008569 process Effects 0.000 description 6
- 238000003058 natural language processing Methods 0.000 description 5
- 230000009193 crawling Effects 0.000 description 4
- 238000012790 confirmation Methods 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 230000011218 segmentation Effects 0.000 description 3
- 230000036541 health Effects 0.000 description 2
- 229910052739 hydrogen Inorganic materials 0.000 description 2
- 101100409308 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) adv-1 gene Proteins 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000013523 data management Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000006397 emotional response Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/335—Filtering based on additional data, e.g. user or group profiles
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/06—Buying, selling or leasing transactions
- G06Q30/0601—Electronic shopping [e-shopping]
- G06Q30/0631—Item recommendations
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/08—Insurance
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Accounting & Taxation (AREA)
- Finance (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Marketing (AREA)
- Economics (AREA)
- Development Economics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- Technology Law (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
技术领域technical field
本发明涉及人工智能技术领域,尤其涉及一种产品智能匹配方法、装置、设备及计算机可读存储介质。The present invention relates to the technical field of artificial intelligence, and in particular, to a method, apparatus, device and computer-readable storage medium for intelligent product matching.
背景技术Background technique
目前市面上存在的所谓的智能产品手主要运用的几个渠道的保险产品的数量和类型都非常有限。与此同时,主流的智能配置平台只能做简单的信息收集和少量的产品匹配,若用户信息量增大则配置成几何级数的上升,运营无法实质展开配置。The number and types of insurance products in several channels mainly used by so-called smart products currently on the market are very limited. At the same time, the mainstream intelligent configuration platform can only do simple information collection and a small amount of product matching. If the amount of user information increases, the configuration will increase geometrically, and the operation cannot be substantially deployed.
用户信息收集方面是预设信息供用户选择,限制了用户信息获取的维度和精度。在用户信息收集的过程中不能准确的判断用户的情绪信息和用户的真实意图,容易被不真实的信息误导,从而无法确定用户的真实需求,产品和用户的匹配度不够精确。User information collection is preset information for users to choose, which limits the dimension and precision of user information acquisition. In the process of user information collection, the user's emotional information and the user's true intention cannot be accurately judged, and it is easy to be misled by untrue information, so that the user's real needs cannot be determined, and the matching degree between the product and the user is not accurate enough.
发明内容SUMMARY OF THE INVENTION
本发明的主要目的在于提供一种产品智能匹配方法、装置、设备及计算机可读存储介质,旨在解决现有用户意图及用户情绪识别准确度低,提高产品与用户的匹配准确度的技术问题。The main purpose of the present invention is to provide a product intelligent matching method, device, equipment and computer-readable storage medium, which aims to solve the technical problem of low accuracy of existing user intention and user emotion recognition and improve the matching accuracy between products and users .
为实现上述目的,本发明提供一种产品智能匹配方法,所述产品智能匹配方法包括以下步骤:In order to achieve the above object, the present invention provides a product intelligent matching method, and the product intelligent matching method includes the following steps:
输出预置问题集至用户,并获取所述用户对预置问题集中每一问题的用户答复信息;outputting a preset question set to the user, and obtaining the user's response information for each question in the preset question set;
对所述用户答复信息进行情绪识别,得到所述用户情绪信息;Perform emotion recognition on the user's reply information to obtain the user's emotion information;
对所述用户答复信息进行意图识别,得到所述用户意图信息,并基于所述用户情绪信息及所述用户意图信息,确定所述用户购买意图;Perform intention identification on the user's reply information to obtain the user's intention information, and determine the user's purchase intention based on the user's emotion information and the user's intention information;
若所述用户购买意图为正向意图,则基于所述用户信息,查询预置产品适配表,输出与所述用户信息匹配的产品。If the user's purchase intention is a positive intention, based on the user information, a preset product adaptation table is queried, and a product matching the user information is output.
可选地,所述输出预置问题集至用户,并获取所述用户对预置问题集中每一问题的用户答复信息之前,还包括:Optionally, before the outputting the preset question set to the user, and acquiring the user response information of the user to each question in the preset question set, the method further includes:
通过网络爬虫技术,采集指定网站显示的预置格式的所有产品文档信息;Collect all product document information in the preset format displayed on the specified website through web crawler technology;
对所述产品文档信息进行实体关系抽取处理,得到所述产品文档信息中各产品知识点的命名实体对象以及命名实体对象之间的实体关系;Performing entity relationship extraction processing on the product document information to obtain named entity objects of each product knowledge point in the product document information and entity relationships between the named entity objects;
根据各产品知识点的命名实体对象,采用正则表达式识别所述产品文档信息中各命名实体对象对应的实体属性;According to the named entity object of each product knowledge point, using a regular expression to identify the entity attribute corresponding to each named entity object in the product document information;
以各产品知识点对应的命名实体对象、实体属性以及实体关系为源数据,构建基于图数据库的产品知识图谱;Using the named entity objects, entity attributes and entity relationships corresponding to each product knowledge point as source data, build a product knowledge graph based on graph database;
基于所述产品知识图谱,创建所述问题集与所述产品适配表。Based on the product knowledge graph, the question set and the product adaptation table are created.
可选地,所述输出预置问题集至用户,并获取所述用户对预置问题集中每一问题的用户答复信息包括:Optionally, the outputting the preset question set to the user, and acquiring the user response information of the user to each question in the preset question set includes:
获取每一轮对话的用户答复信息;Obtain the user's reply information for each round of dialogue;
将所述用户答复信息与预置问题集中的预设槽位进行匹配;matching the user reply information with the preset slots in the preset question set;
若所述用户答复信息与所述预设槽位匹配成功,则抽取所述预设槽位;If the user reply information is successfully matched with the preset slot, extract the preset slot;
若所述用户答复信息中存在与所述预设槽位匹配的答案信息,则将所述答案信息与所述预设槽位关联;If there is answer information matching the preset slot in the user answer information, associating the answer information with the preset slot;
基于所述关联结果,收集用户信息。Based on the association result, user information is collected.
可选地,所述对所述用户答复信息进行情绪识别,得到所述用户情绪信息包括:Optionally, performing emotion recognition on the user reply information to obtain the user emotion information includes:
将所述用户答复信息分割成多个数据区段;dividing the user reply information into a plurality of data segments;
分别对所述数据区段进行情绪判断,得到各种情绪的程度属性数值;Perform emotional judgments on the data sections respectively to obtain the degree attribute values of various emotions;
基于所述各数据区段情绪的程度属性数值,分别确定所述数据区段的情绪标签;Based on the emotional degree attribute value of each data section, determine the emotional label of the data section respectively;
基于所述各数据区段的情绪标签,输出各情绪对应的情绪标签概率分布图;Based on the emotion labels of the data sections, outputting the emotion label probability distribution map corresponding to each emotion;
基于所述情绪标签概率分布图,得到所述用户情绪信息。The user emotion information is obtained based on the emotion tag probability distribution map.
可选地,所述对所述用户答复信息进行意图识别,得到所述用户意图信息,并基于所述用户情绪信息及所述用户意图信息,确定所述用户购买意图包括:Optionally, performing intention identification on the user reply information to obtain the user intention information, and determining the user purchase intention based on the user emotion information and the user intention information includes:
对所述用户答复信息以词为单位进行分析得到命名实体作为用户意图参数候选;Analyzing the user reply information in word units to obtain a named entity as a user intent parameter candidate;
对所述用户答复信息进行解析,并根据预设的用户意图关键候选集逐词模糊匹配,得到意图关键词;Analyzing the user reply information, and fuzzy matching word by word according to a preset user intent key candidate set to obtain intent keywords;
基于所述意图关键词输出用户意图识别结果;outputting a user intent identification result based on the intent keyword;
基于所述用户意图识别结果和所述用户情绪信息,确定所述用户的购买意图。The purchase intention of the user is determined based on the user intention recognition result and the user emotion information.
可选地,所述若所述用户购买意图为正向意图,则基于所述用户信息,查询预置产品适配表,输出与所述用户信息匹配的产品包括:Optionally, if the purchase intention of the user is a positive intention, query a preset product adaptation table based on the user information, and output products matching the user information include:
将所述用户信息转译为标签格式,生成所述用户信息对应的多个标签;Translate the user information into a label format, and generate a plurality of labels corresponding to the user information;
若所述用户购买意图为正向意图,则基于所述标签的预设权重值,分别将所述各标签与预置产品适配表中的产品进行匹配,计算并输出各标签与对应产品的匹配得分;If the purchase intention of the user is a positive intention, then based on the preset weight value of the tag, each tag is matched with the product in the preset product adaptation table, and the relationship between each tag and the corresponding product is calculated and output. match score;
基于所述各标签与对应产品的匹配得分,将所述各标签与对应产品的匹配得分进行求和,得到各产品的匹配程度得分;Based on the matching score of each label and the corresponding product, summing the matching score of each label and the corresponding product to obtain the matching degree score of each product;
若所述产品的匹配程度得分高于预设阈值,则将所述产品标记为推荐,若否,则标记为非推荐;If the matching degree score of the product is higher than the preset threshold, mark the product as recommended, if not, mark it as non-recommended;
将标记为推荐的预设数量个产品作为推荐结果,并输出。The preset number of products marked as recommended are used as recommended results and output.
进一步地,本发明还提供一种产品智能匹配装置,所述产品智能匹配装置包括:Further, the present invention also provides a product intelligent matching device, the product intelligent matching device includes:
获取模块,输出预置问题集至用户,并获取所述用户对预置问题集中每一问题的用户答复信息;an acquisition module, outputting a preset question set to the user, and acquiring the user's response information for each question in the preset question set;
情绪识别模块,用于对所述用户答复信息进行情绪识别,得到当前用户情绪信息;an emotion recognition module, configured to perform emotion recognition on the user response information to obtain current user emotion information;
意图识别模块,对所述用户答复信息进行意图识别,得到所述用户意图信息,并基于所述用户情绪信息及所述用户意图信息,确定所述用户购买意图;an intention recognition module, which performs intention recognition on the user's reply information to obtain the user's intention information, and determines the user's purchase intention based on the user's emotion information and the user's intention information;
产品匹配模块,用于当用户的意图为正向意图时,基于所述用户信息,查询预置产品适配表,输出与所述用户信息匹配的产品。The product matching module is used for querying a preset product adaptation table based on the user information when the user's intention is a positive intention, and outputting a product matching the user information.
可选地,所述产品智能匹配装置还包括:Optionally, the product intelligent matching device further includes:
采集模块,用于通过网络爬虫技术,采集指定网站显示的预置格式的所有产品文档信息;The collection module is used to collect all the product document information in the preset format displayed on the designated website through the web crawler technology;
构建模块,用于对所述产品文档信息进行实体关系抽取处理,得到所述产品文档信息中各产品知识点的命名实体对象以及命名实体对象之间的实体关系;根据各产品知识点的命名实体对象,采用正则表达式识别所述产品文档信息中各命名实体对象对应的实体属性;以各产品知识点对应的命名实体对象、实体属性以及实体关系为源数据,构建基于图数据库的产品知识图谱;The building module is used to perform entity relationship extraction processing on the product document information, and obtain the named entity objects of each product knowledge point in the product document information and the entity relationship between the named entity objects; according to the named entity objects of each product knowledge point object, using regular expressions to identify the entity attributes corresponding to each named entity object in the product document information; using the named entity objects, entity attributes and entity relationships corresponding to each product knowledge point as source data to build a graph database-based product knowledge map ;
创建模块,用于基于所述产品知识图谱,创建所述问题集与所述产品适配表。A creation module is configured to create the question set and the product adaptation table based on the product knowledge graph.
可选地,所述获取模块具体用于:Optionally, the obtaining module is specifically used for:
获取每一轮对话的用户答复信息,将所述用户答复信息与预置问题集中的预设槽位进行匹配;当所述用户答复信息与所述预设槽位匹配成功时,则抽取所述预设槽位;当所述用户答复信息中存在与所述预设槽位匹配的答案信息时,则将所述答案信息与所述预设槽位关联,基于所述关联结果,收集用户信息。Obtain the user reply information of each round of dialogue, and match the user reply information with the preset slot in the preset question set; when the user reply information is successfully matched with the preset slot, extract the Preset slot; when there is answer information matching the preset slot in the user reply information, then associate the answer information with the preset slot, and collect user information based on the association result .
可选地,所述情绪识别模块具体用于:Optionally, the emotion recognition module is specifically used for:
将所述用户答复信息分割成多个数据区段,分别对所述数据区段进行情绪判断,得到各种情绪的程度属性数值;基于所述各数据区段情绪的程度属性数值,分别确定所述数据区段的情绪标签,输出各情绪对应的情绪标签概率分布图,得到所述用户情绪。The user reply information is divided into a plurality of data sections, and emotional judgments are performed on the data sections to obtain the degree attribute values of various emotions; based on the emotional degree attribute values of the various data sections, determine the The emotion label of the data section is output, and the probability distribution map of the emotion label corresponding to each emotion is output to obtain the user emotion.
可选地,所述意图识别模块具体用于:Optionally, the intention recognition module is specifically used for:
对所述用户答复信息以词为单位进行分析得到命名实体作为用户意图参数候选,对所述用户答复信息进行解析,并根据预设的用户意图关键候选集逐词模糊匹配,得到意图关键词,输出用户意图识别结果,基于所述用户意图识别结果和所述用户情绪,确定所述用户的购买意图。Analyzing the user reply information in units of words to obtain a named entity as a user intent parameter candidate, analyzing the user reply information, and word-by-word fuzzy matching according to a preset user intent key candidate set to obtain intent keywords, A user intent identification result is output, and the user's purchase intent is determined based on the user intent identification result and the user emotion.
可选地,所述产品推荐模块具体用于:Optionally, the product recommendation module is specifically used for:
将所述用户信息转译为标签格式,生成所述用户信息对应的多个标签,当所述用户购买意图为正向意图时,则基于所述标签的预设权重值,分别将所述各标签与预置产品适配表中的产品进行匹配,计算并输出各标签与对应产品的匹配得分,将所述各标签与对应产品的匹配得分进行求和,得到各产品的匹配程度得分,当所述产品的匹配程度得分高于预设阈值时,则将所述产品标记为推荐,若否,则标记为非推荐;将标记为推荐的预设数量个产品作为推荐结果并输出。Translate the user information into a tag format, and generate multiple tags corresponding to the user information. When the user's purchase intention is a positive intention, then based on the preset weight value of the tag, the tags are respectively Match with the products in the preset product adaptation table, calculate and output the matching scores of each label and the corresponding product, and sum up the matching scores of each label and the corresponding product to obtain the matching degree score of each product. When the matching degree score of the product is higher than the preset threshold, the product is marked as recommended, if not, it is marked as non-recommended; a preset number of products marked as recommended are used as recommendation results and output.
进一步地,为实现上述目的,本发明还提供一种产品智能匹配设备,所述产品智能匹配设备包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的产品智能匹配程序,所述产品智能匹配程序被所述处理器执行时实现如上述任一项所述的产品智能匹配方法的步骤。Further, in order to achieve the above object, the present invention also provides a product intelligent matching device, the product intelligent matching device includes a memory, a processor, and a product intelligent matching device stored on the memory and running on the processor. A program, when the intelligent product matching program is executed by the processor, implements the steps of any one of the above-mentioned methods for intelligent product matching.
进一步地,为实现上述目的,本发明还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有产品智能匹配程序,所述产品智能匹配程序被处理器执行时实现如上述任一项所述的产品智能匹配方法的步骤。Further, in order to achieve the above object, the present invention also provides a computer-readable storage medium on which a product intelligent matching program is stored, and when the product intelligent matching program is executed by a processor, any of the above-mentioned items are realized. The steps of a described product intelligent matching method.
本发明提出的产品智能匹配方法,完善了目前市面上存在的所谓的智能产品助手业务模式和技术方面的缺陷,提供了以用户信息和用户需求为基准的产品智能匹配方法。通过网络爬虫技术采集指定网站的产品文档信息,并对产品文档信息进行加工整理,构建针对产品知识点的问题集,以供进行多轮对话收集用户信息从而避免简单的正向收集用户信息时可能造成的信息缺失或对保险理解不一致。同时,基于多个维度收集用户个性化信息,通过预设模型的知识图谱问答解决用户需求和产品匹配的问题。通过用户&产品适配模型匹配后,能进行精准的产品推荐。The intelligent product matching method proposed by the present invention improves the current market so-called intelligent product assistant business model and technical defects, and provides a product intelligent matching method based on user information and user needs. Collect the product document information of the designated website through the web crawler technology, process the product document information, and build a problem set for product knowledge points for multiple rounds of dialogue to collect user information, so as to avoid the possibility of simple positive collection of user information. The resulting lack of information or inconsistent understanding of insurance. At the same time, it collects user personalized information based on multiple dimensions, and solves the problem of user needs and product matching through the knowledge graph question and answer of the preset model. After matching through the user & product adaptation model, accurate product recommendations can be made.
附图说明Description of drawings
图1为本发明产品智能匹配设备实施例方案涉及的设备硬件运行环境的结构示意图;1 is a schematic structural diagram of a device hardware operating environment involved in an embodiment of an intelligent product matching device solution according to the present invention;
图2为本发明产品智能匹配方法一实施例的流程示意图;2 is a schematic flowchart of an embodiment of an intelligent product matching method according to the present invention;
图3为图2中步骤S110一实施例的细化流程示意图;FIG. 3 is a schematic diagram of a refinement flow of an embodiment of step S110 in FIG. 2;
图4为本发明产品智能匹配方法第二实施例的流程示意图;4 is a schematic flowchart of a second embodiment of the intelligent product matching method of the present invention;
图5为图2中步骤S120一实施例的细化流程示意图;FIG. 5 is a schematic diagram of a refinement flow of an embodiment of step S120 in FIG. 2;
图6为图2中步骤S130一实施例的细化流程示意图;FIG. 6 is a schematic diagram of a refinement flow of an embodiment of step S130 in FIG. 2;
图7为图2中步骤S140一实施例的细化流程示意图;FIG. 7 is a schematic diagram of a refinement flow of an embodiment of step S140 in FIG. 2;
图8为本发明产品智能匹配装置一实施例的功能模块示意图。FIG. 8 is a schematic diagram of functional modules of an embodiment of an intelligent product matching device according to the present invention.
本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics and advantages of the present invention will be further described with reference to the accompanying drawings in conjunction with the embodiments.
具体实施方式Detailed ways
应当理解,此处所描述的具体实施例仅用以解释本发明,并不用于限定本发明。It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.
本发明提供一种产品智能匹配设备。The invention provides an intelligent product matching device.
参照图1,图1为本发明产品智能匹配设备实施例方案涉及的设备硬件运行环境的结构示意图。Referring to FIG. 1 , FIG. 1 is a schematic structural diagram of a device hardware operating environment involved in an embodiment of an intelligent product matching device according to the present invention.
如图1所示,该产品智能匹配设备可以包括:处理器1001,例如CPU,通信总线1002、用户接口1003,网络接口1004,存储器1005。其中,通信总线1002用于实现这些组件之间的连接通信。用户接口1003可以包括显示屏(Display)、输入单元比如键盘(Keyboard),可选用户接口1003还可以包括标准的有线接口、无线接口。网络接口1004可选的可以包括标准的有线接口、无线接口(如WI-FI接口)。存储器1005可以是高速RAM存储器,也可以是稳定的存储器(non-volatile memory),例如磁盘存储器。存储器1005可选的还可以是独立于前述处理器1001的存储设备。As shown in FIG. 1 , the product intelligent matching device may include: a
本领域技术人员可以理解,图1中示出的产品智能匹配设备的硬件结构并不构成对产品智能匹配设备的限定,可以包括比图示更多或更少的部件,或者组合某些部件,或者不同的部件布置。Those skilled in the art can understand that the hardware structure of the intelligent product matching device shown in FIG. Or a different component arrangement.
如图1所示,作为一种计算机可读存储介质的存储器1005中可以包括操作系统、网络通信模块、用户接口模块以及产品智能匹配程序。其中,操作系统是管理和控制产品智能匹配设备与软件资源的程序,支持网络通信模块、用户接口模块、产品智能匹配程序以及其他程序或软件的运行;网络通信模块用于管理和控制网络接口1004;用户接口模块用于管理和控制用户接口1003。As shown in FIG. 1 , the
在图1所示的产品智能匹配设备硬件结构中,网络接口1004主要用于连接系统后台,与系统后台进行数据通信;用户接口1003主要用于连接客户端(用户端),与客户端进行数据通信;产品智能匹配设备通过处理器1001调用存储器1005中存储的产品智能匹配程序,并执行以下产品智能匹配方法的各实施例的操作。In the hardware structure of the product intelligent matching device shown in FIG. 1 , the
基于上述产品智能匹配设备硬件结构,提出本发明产品智能匹配方法的各个实施例。Based on the above-mentioned hardware structure of the product intelligent matching device, various embodiments of the product intelligent matching method of the present invention are proposed.
参照图2,图2为本发明产品智能匹配方法一实施例的流程示意图。本实施例中,所述产品智能匹配方法包括以下步骤:Referring to FIG. 2 , FIG. 2 is a schematic flowchart of an embodiment of an intelligent product matching method of the present invention. In this embodiment, the intelligent product matching method includes the following steps:
步骤S110,输出预置问题集至用户,并获取所述用户对预置问题集中每一问题的用户答复信息;Step S110, outputting the preset question set to the user, and acquiring the user response information of the user to each question in the preset question set;
本实施例中,问题集是通过阅读理解模型对保险产品文档中保险知识的解析提取,自动生成的针对保险文档各知识点的问答库,能支持多轮对话收集用户信息。进一步地避免了简单的正向收集用户信息时可能造成的信息缺失或对保险理解不一致。基于多轮对话中每一轮对话所收集到的用户答复信息,多维度收集用户信息。In this embodiment, the question set is an automatically generated question and answer library for each knowledge point of the insurance document through the analysis and extraction of the insurance knowledge in the insurance product document through the reading comprehension model, which can support multiple rounds of dialogues to collect user information. It further avoids the lack of information or inconsistent understanding of insurance that may be caused by simply collecting user information in a forward direction. Based on the user response information collected in each round of the multi-round dialogue, the user information is collected in multiple dimensions.
本实施例中,用户信息是指用户个人信息、家庭关系信息、家庭资产状况、风险配好信息以及保险用途信息等。根据多维度用户信息,为用户提供更精准的保险产品配置。In this embodiment, the user information refers to the user's personal information, family relationship information, family asset status, risk allocation information, insurance usage information, and the like. Provide users with more accurate insurance product configuration based on multi-dimensional user information.
步骤S120,对所述用户答复信息进行情绪识别,得到所述用户情绪信息;Step S120, performing emotion recognition on the user reply information to obtain the user emotion information;
本实施例中,用户答复信息是指在多轮对话过程中,根据预置问题集进行对话收集到的用户的答复信息。对用户答复信息进行情绪识别,提取用户答复信息的音频特征或语义特征等,根据所述情绪识别结果中,确定用户情绪信息。In this embodiment, the user's reply information refers to the user's reply information collected by conducting a dialogue according to a preset set of questions during multiple rounds of dialogue. Perform emotion recognition on the user's reply information, extract audio features or semantic features of the user's reply information, etc., and determine the user's emotion information according to the emotion recognition result.
本实施例中,用户情绪信息是指多轮对话时,用户回答问题时候的情绪信息,若用户回答情绪消极,则要在多轮对话中,对目标问题进行委婉的二次确认,从而获取用户的真实需求。比如说,多轮对话中,系统提问“您是否想了解重疾险”,若用户回答“是的,请具体介绍一下重疾险的详细内容”,则判定用户的情绪信息是积极的,想要了解“重疾险”,同时也说明用户对重疾险有一定的购买需求。若用户回答的是“还行吧”或者“可以了解一下”,则判定用户对重疾险并不感兴趣,情绪信息是消极的。In this embodiment, the user's emotional information refers to the emotional information when the user answers the question during multiple rounds of dialogue. If the user's answer is negative, the target question needs to be euphemistically confirmed twice in the multiple rounds of dialogue, so as to obtain the user's emotional response. real needs. For example, in multiple rounds of conversations, the system asks "Do you want to know about critical illness insurance?" If the user answers "Yes, please introduce the details of critical illness insurance", it is determined that the user's emotional information is positive. To understand "critical illness insurance", it also shows that users have certain purchase needs for critical illness insurance. If the user answers "Is it okay" or "I can learn about it", it is determined that the user is not interested in critical illness insurance, and the emotional information is negative.
步骤S130,对所述用户答复信息进行意图识别,得到所述用户意图信息,并基于所述用户情绪信息及所述用户意图信息,确定所述用户购买意图;Step S130, performing intention identification on the user reply information to obtain the user intention information, and determining the user's purchase intention based on the user emotion information and the user intention information;
本实施例中,对用户答复信息进行意图识别是指,对用户答复信息进行语义识别和文本解析,根据语义识别和文本分析结果,判断用户的基本意图信息。然后再根据情绪识别结果,确定用户的购买意图。比如说,系统提问“您是否熟悉我公司少儿险的具体条款内容”,用户的答复信息为“不太了解,可以详细介绍一下少儿险吗”对用户的答复信息进行语义分析和文本解析,判断出用户意图信息为:“用户不了解少儿险”“用户希望具体了解一下少儿险的具体条款内容”,同时也可以判断出用户的态度是积极的,根据用户的意图信息和用户的情绪信息,进一步的判断出用户的购买意图:用户有购买少儿险的意图。In this embodiment, performing intention recognition on the user's reply information refers to performing semantic recognition and text analysis on the user's reply information, and judging the basic intention information of the user according to the results of the semantic recognition and text analysis. Then, according to the emotion recognition results, the purchase intention of the user is determined. For example, the system asks "Are you familiar with the specific terms and conditions of our company's children's insurance", and the user's reply information is "I don't know much, can you introduce the children's insurance in detail?" Semantic analysis and text analysis are performed on the user's response information to determine The user intention information is: "The user does not understand children's insurance", "The user wants to know the specific terms of the children's insurance", and it can also be judged that the user's attitude is positive. According to the user's intention information and the user's emotional information, Further determine the user's purchase intention: the user has the intention to purchase children's insurance.
步骤S140,若所述用户购买意图为正向意图,则基于所述用户信息,查询预置产品适配表,输出与所述用户信息匹配的产品。Step S140, if the user's purchase intention is a positive intention, query a preset product adaptation table based on the user information, and output a product matching the user information.
本实施例中,正向意图也可以说是真实购买意图。若用户的购买意图为正向的,则说明用户对产品有兴趣,有购买意愿和需求。根据收集到的用户信息,查询预置的产品适配表,输出与用户的购买需求相匹配的产品。In this embodiment, the positive intention can also be said to be the real purchase intention. If the user's purchase intention is positive, it means that the user is interested in the product and has purchase intention and demand. According to the collected user information, query the preset product adaptation table, and output products that match the user's purchasing needs.
本实施例中,产品适配表是指根据用户年龄、工作性质、收入状况(当前及预期)、家庭状况、风险偏好、理财能力等多维度信息设计的适合该用户的保险产品的集合。比如说,根据用户的信息来看,用户可以买产品的适配表为A、C、S、H、K,但是用户的需求是买产品A、D、H,根据用户的正向购买意图和用户信息,查询产品适配表,输出匹配的产品。In this embodiment, the product adaptation table refers to a collection of insurance products suitable for the user designed according to multi-dimensional information such as the user's age, work nature, income status (current and expected), family status, risk preference, and financial ability. For example, according to the user's information, the user can buy products with the adaptation table A, C, S, H, K, but the user's demand is to buy products A, D, H, according to the user's positive purchase intention and User information, query product adaptation table, and output matching products.
本实施例中,通过由保险文档知识点生成的问题集与用户进行多轮对话,基于多个维度收集更精准的用户信息,避免了预设信息供用户选择,而造成的用户信息获取维度和精准度受限的问题。同时在多轮对话收集用户信息过程中增加使用情绪识别能力,对用户答复信息中的所携带的情绪进行识别,判断用户的情绪。若用户的情绪为积极的,则可以确定该答复为用户的真实意图;若用户的情绪比较消极,则进行委婉的二次确认,以确认用户的真实意图提升了对用户真实意图判断的精度,从而获取用户的真实需求,避免被不真实的信息误导。In this embodiment, multiple rounds of dialogue are conducted with the user through the question set generated by the insurance document knowledge points, and more accurate user information is collected based on multiple dimensions, which avoids preset information for the user to choose, and the user information acquisition dimension and The problem of limited accuracy. At the same time, in the process of collecting user information in multiple rounds of dialogue, the ability to use emotion recognition is increased, and the emotion carried in the user's reply information is recognized, and the user's emotion is judged. If the user's emotion is positive, it can be determined that the answer is the user's true intention; if the user's emotion is relatively negative, a euphemistic secondary confirmation is performed to confirm the user's true intention and improve the accuracy of the user's true intention judgment. So as to obtain the real needs of users and avoid being misled by untrue information.
进一步地,根据用户的答复信息对用户的意图进行识别,得到用户意图识别结果。将用户意图和用户情绪两者结合在一起,根据用户信息,查询产品适配表,输出比较适合用户的产品。其中,产品适配表是依据用户年龄、工作性质、收入状况(当前及预期)、家庭状况、风险偏好、理财能力等多维度信息设计的比较适合用户购买的保险产品的集合,通过适配规则,根据用户的主观需求配置适合的产品推荐给用户。Further, the user's intention is identified according to the user's reply information, and the user's intention identification result is obtained. Combining user intentions and user emotions, querying the product adaptation table based on user information, and outputting products that are more suitable for users. Among them, the product adaptation table is a collection of insurance products that are more suitable for users to purchase based on multi-dimensional information such as the user's age, work nature, income status (current and expected), family status, risk preference, and financial ability. , according to the subjective needs of users, configure suitable products and recommend them to users.
参照图3,图3为图2中步骤S110一实施例的细化流程示意图。本实施例中,上述步骤S110包括:Referring to FIG. 3 , FIG. 3 is a schematic diagram of a refinement flow of an embodiment of step S110 in FIG. 2 . In this embodiment, the above step S110 includes:
步骤S1101,获取每一轮对话的用户答复信息;Step S1101, obtaining user reply information for each round of dialogue;
本实施例中,多轮对话适用于封闭场景,是一种在人机对话中,初步明确用户意图之后,获取必要信息以最终得到明确用户指令的方式。根据多轮对话收集到的用户答复信息,其本身所包含的信息也只占总传递信息量的一小部分,更多信息来源于说话人的身份、当前的时间/地点等一系列场景信息。所以多轮对话的信息获取方式也不应当只局限于用户所说的话。In this embodiment, the multi-round dialogue is suitable for closed scenarios, which is a method of obtaining necessary information to finally obtain a clear user instruction after initially clarifying the user's intention in the human-machine dialogue. According to the user's reply information collected in multiple rounds of dialogue, the information contained in it itself only accounts for a small part of the total amount of information transmitted, and more information comes from a series of scene information such as the speaker's identity and current time/place. Therefore, the information acquisition method of multi-round dialogue should not be limited to what the user said.
多轮对话在形式上并不一定表现为与用户的多次对话交互,比如说,如果用户的话语中已经提供了充足的信息,或者其它来源的补充信息已经足够将用户的初步意图转化为一条明确的用户指令,那就不会存在与用户的多次对话交互。根据多轮对话,获取用户答复信息。Multi-turn dialogues do not necessarily represent multiple dialogue interactions with the user in form, for example, if sufficient information has been provided in the user's utterance, or supplementary information from other sources is sufficient to convert the user's initial intention into a sentence. With explicit user instructions, there will be no multiple conversational interactions with the user. Obtain user response information according to multiple rounds of dialogue.
本实施例中,用户答复信息可以是用户的个人基本信息,也可以是用户的风险配置偏好信息等。In this embodiment, the user reply information may be the user's personal basic information, or may be the user's risk configuration preference information or the like.
步骤S1102,将所述用户答复信息与预置问题集中的预设槽位进行匹配;Step S1102, matching the user reply information with a preset slot in a preset question set;
本实施例中,将获取的用户答复信息与预置问题集中的预设槽位进行匹配。In this embodiment, the acquired user reply information is matched with the preset slots in the preset question set.
本实施例中,槽位是在多轮对话过程中将初步用户意图转化为明确用户指令所需要补全的信息。一个槽位与一件事情的处理中所需要获取的一种信息相对应。其中,需要说明的是,并非所有的槽位都要被匹配成功并关联,以以下对话为例:我:【去萧山机场多少钱】,出租车司机:【70】;对话中的【70】,应当被理解为70元人民币,而不必再去追问:【你说的是人民币、美元、日元还是港币?】。这类信息应当以默认值的形式存在,也即槽有必关联与非必关联之分,与上文所说的【信息未必需要通过与用户的对话获取】相对应。In this embodiment, the slot is the information that needs to be completed to convert the preliminary user intention into a clear user instruction in the process of multiple rounds of dialogue. A slot corresponds to a kind of information that needs to be obtained in the processing of one thing. Among them, it should be noted that not all slots must be successfully matched and associated. Take the following dialogue as an example: me: [how much does it cost to go to Xiaoshan Airport], taxi driver: [70]; [70] in the dialogue , should be understood as RMB 70, and there is no need to ask: [Are you talking about RMB, US dollar, Japanese yen or Hong Kong dollar? ]. This type of information should exist in the form of a default value, that is, the slot has a necessary association and a non-essential association, which corresponds to the above-mentioned [information does not necessarily need to be obtained through a dialogue with the user].
步骤S1103,若所述用户答复信息与所述预设槽位匹配成功,则抽取所述预设槽位;Step S1103, if the user reply information is successfully matched with the preset slot, extract the preset slot;
本实施例中,槽位分为词槽与接口槽两种槽位类型,若用户答复信息与预设槽位匹配成功,则提取该匹配成功的槽位,比如说,用户表达“不了”,“不行”,“不是”,“没有”等一系列说法均可以与槽位【不】匹配成功,则提取该槽位。In this embodiment, the slots are divided into two slot types: word slots and interface slots. If the user's reply information matches the preset slot successfully, the successfully matched slot is extracted. For example, if the user expresses "no", A series of statements such as "No", "No", "No" can be successfully matched with the slot [No], then the slot will be extracted.
步骤S1104,若所述用户答复信息中存在与所述预设槽位匹配的答案信息,则将所述答案信息与所述预设槽位关联,并输出关联结果;Step S1104, if there is answer information matching the preset slot in the user reply information, then associate the answer information with the preset slot, and output an association result;
本实施例中,若用户答复信息中存在与该匹配成功的槽位匹配的答案信息,则将相关的答案信息填入该槽位中,并输出。这里需要分成两种情况来讨论,一种是:我们明确知道相关的答案信息,可以直接填入槽位中,不需要向用户确认。比如说,在多轮对话中,系统问到用户“请问您家庭收入怎么样呢?”用户回答“我一年收入大概10万”,那就会与“年收入”这个槽位匹配成功。另一种是:我们知道相关的答案信息只能作为参考,需要用户的协助才能进行槽位的填写,比如说用户表达“不了”,“不行”,“不是”,“没有”等一系列说法均可以与槽位【不】匹配成功,这种情况下,需要提供选项,让用户最终决定该槽位的填入值。In this embodiment, if there is answer information matching the successfully matched slot in the user's reply information, the relevant answer information is filled in the slot and output. There are two situations to discuss here. One is: We clearly know the relevant answer information and can directly fill in the slot without confirming to the user. For example, in multiple rounds of dialogue, the system asks the user "How is your household income?" The user answers "My annual income is about 100,000", then it will be successfully matched with the "Annual Income" slot. The other is: we know that the relevant answer information can only be used as a reference, and the user's assistance can be used to fill in the slot. For example, the user expresses a series of statements such as "no", "no", "no", "no" and so on. Both can be successfully matched with the slot [Not]. In this case, an option needs to be provided so that the user can finally decide the filling value of the slot.
步骤S1105,基于所述关联结果,收集用户信息。Step S1105: Collect user information based on the association result.
本实施例中,若用户答复信息与问题集中的预设槽位匹配成功,则称用户答复信息与槽位有关联。比如说,在多轮对话中,系统问到用户“请问您家庭收入怎么样呢?”用户回答“我一年收入大概10万”,那就会与“年收入”这个槽位匹配成功。将用户回答中的“10万”与这个槽位相关联,并输出关联结果。其中的关联结果就是“年收入10万”,根据关联结果收集用户信息,用户的年收入是10万元。In this embodiment, if the user's reply information is successfully matched with the preset slot in the question set, it is said that the user's reply information is associated with the slot. For example, in multiple rounds of dialogue, the system asks the user "How is your household income?" The user answers "My annual income is about 100,000", then it will be successfully matched with the "Annual Income" slot. Associate "100,000" in the user's answer with this slot, and output the associated result. The association result is "annual income of 100,000 yuan", and user information is collected according to the association result, and the user's annual income is 100,000 yuan.
参照图4,图4为本发明产品智能匹配方法第二实施例的流程示意图。本实施例中,上述步骤S110之前,还包括:Referring to FIG. 4 , FIG. 4 is a schematic flowchart of the second embodiment of the intelligent product matching method of the present invention. In this embodiment, before the above step S110, the method further includes:
步骤S210,通过网络爬虫技术,采集指定网站显示的预置格式的所有产品文档信息;Step S210, collecting all product document information in a preset format displayed on a designated website through a web crawler technology;
本实施例中,通过网络爬虫技术可以将互联网中数以百亿计的网页信息保存到本地。具体为通过爬虫代码程序模拟浏览器向网络服务器发送请求,以便将网络资源从网络流中读取出来并保存到本地,此外,还进一步基于相关信息提取规则,从爬取的信息中提取用户需要的信息。In this embodiment, tens of billions of web page information in the Internet can be saved locally by means of the web crawler technology. Specifically, the crawler code program simulates the browser to send a request to the network server, so as to read the network resources from the network stream and save it locally. In addition, based on the relevant information extraction rules, the user needs are extracted from the crawled information. Information.
本实施例中,基于爬取的网站类型的不同,因此采集的产品文档信息内容亦不相同。比如,从某一网站中爬取数据,则采集的内容为产险相关的信息,而如果是从某一科技公司官方网站中爬取数据,则采集的内容为科技知识,比如大数据管理,数据精算,AI等。基于爬取的内容的不同,因此构建的产品知识图谱亦不相同。In this embodiment, based on the different types of websites to be crawled, the content of the collected product document information is also different. For example, if data is crawled from a certain website, the content collected is property insurance related information, while if data is crawled from the official website of a technology company, the content collected is scientific and technological knowledge, such as big data management, Data actuarial, AI, etc. Based on the different crawled content, the product knowledge graphs constructed are also different.
本实施例中,对于爬取方式不限。优选通过Docker容器作为媒介部署指定的爬虫程序,以爬取指定的网站内容。例如爬取保险公司网站中的保险产品信息及对应产品的价位、对应购买人群等内容。爬取的网站包括指定的公示网站以及通过搜索引擎搜索到的网站。In this embodiment, the crawling method is not limited. It is preferable to deploy a specified crawler program through a Docker container as a medium to crawl the specified website content. For example, crawling insurance product information on the website of an insurance company, the price of the corresponding product, and the corresponding purchasers. The crawled websites include designated public websites and websites searched by search engines.
本实施例中,通过网络爬虫技术从市面上所有保险公司的官网爬取最新的保险产品文档入库,并添加保险状态信息。爬虫每周爬取一次,以更新保险产品的状态,以此获取全网的保险产品文档信息。In this embodiment, the latest insurance product documents are crawled from the official websites of all insurance companies in the market through the web crawler technology, and the insurance status information is added. The crawler crawls once a week to update the status of insurance products, so as to obtain the document information of insurance products on the whole network.
本实施例中,保险状态信息是指是保险产品的销售状态信息,包括新上线、在售、停售信息,在此基础上增加保险的销售地域信息和保险公司在当地的分支机构信息。通过销售状态和销售地域,才能确定给用户推荐的保险产品集,否则部分产品已停售、或地区不支持,该类保险产品再好也不适合推荐。In this embodiment, the insurance status information refers to the sales status information of the insurance product, including the information of newly launched, on sale, and discontinued sale. On this basis, the information of the insurance sales area and the local branch of the insurance company is added. The set of insurance products recommended to users can be determined only by the sales status and sales region. Otherwise, some products have been discontinued or the region does not support them, and this type of insurance product is not suitable for recommendation no matter how good it is.
步骤S220,对所述产品数据进行实体关系抽取处理,得到所述产品文档信息中各产品知识点的命名实体对象以及命名实体对象之间的实体关系;Step S220, performing entity relationship extraction processing on the product data to obtain named entity objects of each product knowledge point in the product document information and entity relationships between named entity objects;
本实施例中,为便于根据产品知识点生成针对保险文档各知识点的问题集,支持多轮对话反向确认用户信息,因此需要预先获得产品知识点以及各产品知识点之间的实体关系,本实施例中具体采用自然语言处理技术进行实体关系抽取处理,也即抽取出产品文档信息中各产品知识点的命名实体对象以及命名实体对象之间的实体关系。In this embodiment, in order to facilitate the generation of a question set for each knowledge point of the insurance document according to the product knowledge points, and to support the reverse confirmation of user information in multiple rounds of dialogue, it is necessary to obtain the product knowledge points and the entity relationship between the various product knowledge points in advance. In this embodiment, the natural language processing technology is used to extract the entity relationship, that is, the named entity object of each product knowledge point in the product document information and the entity relationship between the named entity objects are extracted.
自然语言处理技术(Natural Language Processing,NLP)的主要目的在于帮助机器更好地理解人的语言,包括基础的词法、句法等语义理解,以及需求、情感等高层面的理解,进而弥补人类交流(自然语言)和计算机理解(机器语言)之间的差距。The main purpose of Natural Language Processing (NLP) technology is to help machines better understand human language, including basic lexical, syntactic and other semantic understanding, as well as high-level understanding such as needs and emotions, and then make up for human communication ( The gap between natural language) and computer understanding (machine language).
本实施例中,在爬取到网站中预先指定的相关内容后,需要通过自然语言处理技术从爬取的内容中抽取产品知识点,例如,产品价格信息、产品条款信息、产品常用词和常用问答以及相关引用的产品文档内容等。In this embodiment, after crawling the relevant content pre-specified in the website, it is necessary to extract product knowledge points from the crawled content through natural language processing technology, for example, product price information, product term information, product common words and commonly used products Questions and answers and related referenced product documentation content, etc.
本实施例通过基于自然语言处理的知识抽取技术,获取对应的保险产品详情、保险产品专有名词以及常用保险产品概念等知识点。具体基于保险产品文档信息中的产品名称、状态信息以及赔付范围从爬取的网站内容中进行知识点抽取。In this embodiment, knowledge points such as corresponding insurance product details, insurance product proper nouns, and commonly used insurance product concepts are acquired through the knowledge extraction technology based on natural language processing. Specifically, knowledge points are extracted from the crawled website content based on the product name, status information and compensation scope in the insurance product document information.
步骤S230,根据各产品知识点的命名实体对象,采用正则表达式识别所述产品文档信息中各命名实体对象对应的实体属性;Step S230, according to the named entity object of each product knowledge point, using a regular expression to identify the entity attribute corresponding to each named entity object in the product document information;
本实施例中,本实施例中,为构建产品知识结构图,在抽取出产品文档信息中各产品知识点的命名实体对象后,进一步采用正则表达式识别出产品文档中各命名实体对象对应的实体属性。In this embodiment, in this embodiment, in order to construct a product knowledge structure diagram, after extracting the named entity objects of each product knowledge point in the product document information, a regular expression is further used to identify the corresponding named entity objects in the product document. Entity properties.
正则表达式描述了一种字符串匹配的模式,可以用来检查一个串是否含有某种子串、将匹配的子串替换或者从某个串中取出符合某个条件的子串等。它的设计思想是用一种描述性的语言来给字符串定义一个规则,凡是符合规则的字符串,则认为该字符串与正则表达式“匹配”。Regular expressions describe a pattern of string matching, which can be used to check whether a string contains a certain substring, replace a matched substring, or extract a substring that meets a certain condition from a string, etc. Its design idea is to use a descriptive language to define a rule for a string, and any string that conforms to the rule is considered to "match" the regular expression.
本实施例中预先编写出可用于识别产品文档信息中命名实体对象的实体属性的正则表达式模式,一个正则表达式模式可以是由简单的字符构成,也可以是由多种字符、不同方法组合而成。In this embodiment, a regular expression pattern that can be used to identify entity attributes of named entity objects in product document information is pre-written. A regular expression pattern can be composed of simple characters, or can be composed of multiple characters and different methods. made.
例如,保险产品信息中中通常都有如下表达方式:XX保险,是以XX人作为被保险人的保险,或者是XX保险险种之一,分为XX健康保险、教育保险等,则可设置与上述表达方式相匹配的正则表达式模式,进而识别出保险产品信息中中具体属性内容的表达方式。For example, insurance product information usually has the following expressions: XX insurance, insurance with XX people as the insured, or one of XX insurance types, divided into XX health insurance, education insurance, etc., you can set and The regular expression pattern that matches the above expressions, and then identifies the expression method of the specific attribute content in the insurance product information.
步骤S240,以各产品知识点对应的命名实体对象、实体属性以及实体关系为源数据,构建基于图数据库的产品知识图谱;Step S240, using the named entity object, entity attribute and entity relationship corresponding to each product knowledge point as source data to construct a product knowledge graph based on a graph database;
本实施例中,以各产品知识点对应的命名实体对象、实体属性以及实体关系为源数据,构建基于图数据库的产品知识结构图,比如构建保险产品知识图谱。同时,基于抽取到的保险产品知识点类型的不同,对应构建不同的保险产品知识结构图。In this embodiment, the named entity object, entity attribute and entity relationship corresponding to each product knowledge point are used as source data to construct a product knowledge structure graph based on a graph database, for example, an insurance product knowledge graph is constructed. At the same time, based on the different types of insurance product knowledge points extracted, different insurance product knowledge structure diagrams are correspondingly constructed.
步骤S250,基于所述产品知识图谱,创建所述问题集与所述产品适配表。Step S250, based on the product knowledge graph, create the question set and the product adaptation table.
本实施例中,基于构建的产品知识图谱中主体内容的不同,因此可创建不同产品知识的查询页面,以供用户进行不同产品知识内容检索。例如,既可以提供少儿健康保险知识点检索,还可以少儿教育保险等相似险种检索,从而为用户提供更全面高效的信息检索服务。In this embodiment, based on the difference of main contents in the constructed product knowledge graph, query pages for different product knowledge can be created for the user to retrieve different product knowledge contents. For example, it can provide not only the retrieval of knowledge points of children's health insurance, but also the retrieval of similar insurance types such as children's education insurance, so as to provide users with more comprehensive and efficient information retrieval services.
本实施例通过网络爬虫技术采集指定网站的产品文档信息,并对产品文档信息进行加工整理以形成产品知识结构图;然后再基于产品知识结构图,创建针对产品文档各知识点的问题集,以供进行多轮对话,收集用户信息。通过爬取方式采集产品文档信息,因而采集到的产品文档信息能够满足一般用户对于产品知识的查询需求。此外,本实施例将产品知识结构图作为产品查询服务的检索数据库,这不仅能够提供海量产品文档知识,同时还能为用户提供更高效快速的产品知识查询服务,进而优化了产品配置。In this embodiment, the product document information of the designated website is collected through the web crawler technology, and the product document information is processed and arranged to form a product knowledge structure diagram; For multiple rounds of conversations to collect user information. The product document information is collected by crawling, so the collected product document information can meet the query needs of general users for product knowledge. In addition, this embodiment uses the product knowledge structure diagram as the retrieval database of the product query service, which can not only provide massive product document knowledge, but also provide users with a more efficient and fast product knowledge query service, thereby optimizing product configuration.
本实施例中,产品适配表是依据用户年龄、工作性质、收入状况(当前及预期)、家庭状况、风险偏好、理财能力等多维度信息设计的比较适合用户购买的保险产品的集合,通过适配规则,根据用户的主观需求配置适合的产品推荐给用户。In this embodiment, the product adaptation table is a collection of insurance products that are more suitable for users to purchase based on multi-dimensional information such as the user's age, work nature, income status (current and expected), family status, risk preference, and financial ability. Adaptation rules, configure suitable products according to users' subjective needs and recommend them to users.
参照图5,图5为图2中步骤S120一实施例的细化流程示意图。基于上述实施例,本实施例中,上述步骤S120进一步包括:Referring to FIG. 5 , FIG. 5 is a schematic diagram of a refinement flow of an embodiment of step S120 in FIG. 2 . Based on the foregoing embodiment, in this embodiment, the foregoing step S120 further includes:
步骤S1201,将所述用户答复信息分割成多个数据区段;Step S1201, dividing the user reply information into multiple data sections;
本实施例中,将用户答复信息进行解析处理,将解析处理之后生成的数据信息分割成多个数据区段,分别对每一个数据区段进行情绪判断。比如,用户的答复信息为“寿险和少儿险两个,哪一个买给孩子比较合适呢,可以推荐一下吗”,将起分割为多个数据片段“寿险和少儿险两个”、“哪一个买给孩子比较合适呢”、“推荐”。In this embodiment, the user reply information is subjected to analysis processing, the data information generated after the analysis processing is divided into a plurality of data sections, and emotion judgment is performed on each data section respectively. For example, the user's reply information is "two life insurance and children's insurance, which one is more suitable for children to buy, can you recommend it?" It is more suitable to buy for children", "recommended".
步骤S1202,分别对所述数据区段进行情绪判断,得到各种情绪的程度属性数值;Step S1202, performing emotion judgment on the data sections respectively, and obtaining the degree attribute values of various emotions;
本实施例中,情绪类别可以包括但不限于开心、惊喜、正常、愤怒、厌烦、伤心等。分别对用户答复信息的每一个数据区段进行情绪判断,得到各种情绪对应的程度属性值。比如说,用户的答复信息为“寿险和少儿险两个,哪一个买给孩子比较合适呢,可以推荐一下吗”,将起分割为多个数据区段“寿险和少儿险两个”、“哪一个买给孩子比较合适呢”、“推荐”,分别对每一个数据区段进行情绪判断。In this embodiment, the emotion categories may include, but are not limited to, happy, surprised, normal, angry, bored, sad, and the like. The emotion judgment is performed on each data segment of the user's reply information respectively, and the degree attribute value corresponding to each emotion is obtained. For example, the user's reply information is "two life insurance and children's insurance, which one is more suitable for children to buy, can you recommend it?" Which one is more suitable for children?" and "recommended", and make emotional judgments for each data segment.
步骤S1203,基于所述数据区段情绪的程度属性数值,分别确定所述数据区段的情绪标签;Step S1203, based on the degree attribute value of the emotion of the data segment, respectively determine the emotion label of the data segment;
本实施例中,分别得到每一个数据区段中可能包括好几种情绪,同时,各种情绪的程度属性值是不同的,其中程度属性数值最大的对应的情绪标签即为为该数据区段的情绪标签。例如:可以得到“开心”情绪标签的程度属性值为0.522,“惊喜”情绪标签的程度属性值为0.1,“正常”情绪标签的程度属性值为0.3,“愤怒”情绪标签的程度属性值为0.01,“厌烦”情绪标签的程度属性值为0.01,“伤心”情绪标签的程度属性值为0.01,此时,可“开心”情绪标签的程度属性值最大,则该数据区段的情绪类别为开心。In this embodiment, it is obtained that each data segment may include several emotions, and at the same time, the degree attribute values of various emotions are different, and the corresponding emotion label with the largest degree attribute value is the data segment. emotional labels. For example, it can be obtained that the degree attribute value of the “happy” emotional label is 0.522, the degree attribute value of the “surprise” emotional label is 0.1, the degree attribute value of the “normal” emotional label is 0.3, and the degree attribute value of the “angry” emotional label is 0.3. 0.01, the degree attribute value of the “annoying” emotional label is 0.01, and the degree attribute value of the “sad” emotional label is 0.01. At this time, the degree attribute value of the “happy” emotional label is the largest, then the emotional category of this data segment is happy.
步骤S1204,基于所述各数据区段的情绪标签,输出所述用户答复信息中各数据区段对应的情绪标签概率分布图;Step S1204, based on the emotional labels of the data sections, output the probability distribution map of the emotional labels corresponding to each data section in the user reply information;
步骤S1205,基于所述情绪标签的概率分布图,得到所述用户情绪。Step S1205, obtaining the user's emotion based on the probability distribution map of the emotion label.
本实施例中,根据各数据区段的情绪标签,输出该用户答复信息中各数据区段对应的情绪标签的概率分布图。例如:可以得到用户答复信息中的各数据区段的情绪标签分别为,“开心”情绪标签的概率为0.522,“惊喜”情绪标签的概率为0.12,“正常”情绪标签的概率为0.32,“愤怒”情绪标签的概率为0.018,“厌烦”情绪标签的概率为0.01,“伤心”情绪标签的概率为0.01,此时,可将各个情绪标签的概率进行阈值判断,并根据该阈值判断结果从这些情绪标签中确定出该用户的情绪类别。若所有情绪标签的概率均小于预设阈值,则将概率最大的情绪标签作为用户情绪输出。In this embodiment, according to the emotion tags of each data segment, a probability distribution map of emotion tags corresponding to each data segment in the user reply information is output. For example, the emotional labels of each data segment in the user's reply information can be obtained, respectively, the probability of the "happy" emotional label is 0.522, the probability of the "surprise" emotional label is 0.12, the probability of the "normal" emotional label is 0.32, " The probability of the emotional label of "anger" is 0.018, the probability of the emotional label of "annoying" is 0.01, and the probability of the emotional label of "sad" is 0.01. At this time, the probability of each emotional label can be thresholded, and according to the threshold judgment result from The emotion categories of the user are determined from these emotion labels. If the probabilities of all emotion tags are less than the preset threshold, the emotion tag with the highest probability is used as the user emotion output.
另外,该方法不仅可以识别出用户情绪,还可以有利于后续对话交互流程,使系统能对用户作出不同的情感反馈,比如说,如果判断用户情绪消极,则可以在多轮对话中对问题进行委婉的二次确认,从而获取用户真实的需求,避免被不真实的信息误导。In addition, this method can not only identify the user's emotions, but also facilitate the subsequent dialogue interaction process, so that the system can give different emotional feedback to the user. Euphemistic secondary confirmation, so as to obtain the real needs of users and avoid being misled by untrue information.
参照图6,图6为图2中步骤S130的细化流程示意图。基于上述实施例,本实施例中,上述步骤S130进一步包括:Referring to FIG. 6 , FIG. 6 is a schematic diagram of a refinement flow of step S130 in FIG. 2 . Based on the foregoing embodiment, in this embodiment, the foregoing step S130 further includes:
步骤S1301,对所述用户答复信息以词为单位进行分析得到命名实体作为用户意图参数候选;Step S1301, analyzing the user reply information in word units to obtain a named entity as a user intent parameter candidate;
本实施例中,根据用户答复信息的具体数据内容,并以词为单位进行意图分析,得到对应的命名实体,作为用户意图的参数候选。具体实现步骤如下,比如说“给我妈妈购买一份重疾险”,其中,分词、词性标注、命名实体识别:给/p我/r妈妈/n购买/v一/m份/q重疾险/n,命名实体集合为空。获得意图参数候选:由分词和词性标注结果可知,一份重疾险为数词m+量词q+名词n组合,且该名词“重疾险”归于保险产品意图参数,则可置保险=一份重疾险。In this embodiment, according to the specific data content of the user's reply information, intention analysis is performed in units of words, and a corresponding named entity is obtained, which is used as a parameter candidate of the user's intention. The specific implementation steps are as follows, for example, "buy a critical illness insurance for my mother", in which word segmentation, part-of-speech tagging, and named entity recognition: for /pme/rmum/nbuy/vone/m copies/qcritical illness risk/n, the named entity collection is empty. Obtaining intent parameter candidates: From the results of word segmentation and part-of-speech tagging, it can be seen that a critical illness insurance is a combination of numeral m + quantifier q + noun n, and the noun "critical illness insurance" is attributed to the insurance product intent parameter, then insurance = a critical illness insurance risk.
步骤S1302,对所述用户答复信息进行解析,并根据预设的用户意图关键候选集逐词模糊匹配,得到意图关键词;Step S1302, parse the user reply information, and fuzzy match word by word according to a preset user intent key candidate set to obtain intent keywords;
本实施例中,对用户答复信息进行解析,并根据预设的用户意图关键候选集逐词模糊匹配,得到意图关键词。比如说“给我妈妈购买一份重疾险”,依存句法分析:0:给/p 3:ADV 1:我/r 2:ATT 2:妈妈/n 0:POB 3:买/v-1:HED 4:一/m 5:ATT 5:份/q 6:ATT 6:重疾险/n 3:VOB。In this embodiment, the user reply information is parsed, and the intent keywords are obtained by word-by-word fuzzy matching according to a preset user intent key candidate set. For example, "buy a critical illness insurance for my mother", dependency parsing: 0: give/p 3: ADV 1: me/r 2: ATT 2: mother/n 0: POB 3: buy/v-1: HED 4: one/m 5: ATT 5: copies/q 6: ATT 6: critical illness insurance/n 3: VOB.
获得意图关键词:将整理出的买重疾险意图关键词(例如“购买一份重疾险”)同用户答复信息的分词结果进行相似度比较,计算可得cosine(购买,一份重疾险)=1.89,符合相似要求,将“购买”作为意图关键词。Obtaining intent keywords: Compare the similarity between the sorted out intent keywords for buying critical illness insurance (for example, "buy a critical illness insurance") and the word segmentation results of the user's reply information, and calculate to get cosine (purchase, a critical illness insurance) insurance) = 1.89, meeting similar requirements, and using "purchase" as the intent keyword.
步骤S1303,基于所述意图关键词输出用户意图识别结果;Step S1303, output the user intent identification result based on the intent keyword;
本实施例中,根据得到的意图关键词,判断用户意图参数候选之间的依存关系。由依存句法分析的结果可知,意图参数一份重疾险和意图关键词“购买”之间存在依存关系(购买,一份重疾险,VOB),可以得出用户意图为“购买重疾险”In this embodiment, the dependency relationship between the user intent parameter candidates is determined according to the obtained intent keywords. From the results of the dependency syntax analysis, it can be seen that there is a dependency between the intent parameter a critical illness insurance and the intent keyword "purchase" (purchase, a critical illness insurance, VOB), and it can be concluded that the user's intention is "purchase critical illness insurance". risk"
步骤S1304,基于所述用户意图识别结果和所述用户情绪,确定所述用户的购买意图。Step S1304: Determine the purchase intention of the user based on the user intention identification result and the user emotion.
本实施例中,根基用户的基本意图信息和用户的情绪,确定用户深层次的需求,也即用户的购买意图。基本意图信息对应的是用户答复数据所直观反映出的意图,但并无法反映用户当前状态下的真实需求,因此才需要结合用户的情绪来综合确定用户答复信息所实际想要表达的真实购买意图和情绪需求。比如说“我要买保险,重疾险比较适合我,还是寿险比较适合我?”和“我要购买保险的话,重疾险比较适合我,寿险也适合我吗”虽然二者内容差不多,但是因为情绪不同,用户想表达的真实购买需求是完全不相同的。In this embodiment, based on the user's basic intention information and the user's emotion, the user's deep-level needs, that is, the user's purchase intention, are determined. The basic intent information corresponds to the intent intuitively reflected by the user's reply data, but it cannot reflect the user's real needs in the current state. Therefore, it is necessary to combine the user's emotions to comprehensively determine the real purchase intent that the user's reply information actually wants to express. and emotional needs. For example, "I want to buy insurance, is critical illness insurance more suitable for me, or is life insurance more suitable for me?" and "If I want to buy insurance, is critical illness insurance more suitable for me, and life insurance is also suitable for me?" With different emotions, the real purchasing needs that users want to express are completely different.
参照图7,图7为图2中步骤S140一实施例的细化流程示意图。基于上述实施例,本实施例中,上述步骤S140进一步包括:Referring to FIG. 7 , FIG. 7 is a schematic diagram of a refinement flow of an embodiment of step S140 in FIG. 2 . Based on the foregoing embodiment, in this embodiment, the foregoing step S140 further includes:
步骤S1401,将所述用户信息转译为标签格式,生成所述用户信息对应的多个标签;Step S1401, translating the user information into a label format, and generating multiple labels corresponding to the user information;
本实施例中,将用户信息按照预先设置好的标签格式进行处理,生成标签格式的字段信息,其中,生成的每一个标签格式的字段信息,就是标签。其中,生成的多个标签组成的集合,就叫标签集。比如,用户答复信息为“我叫李四,性别女,是一名教师,今年43岁,年收入15万元,家庭资产大概有200万,想买一份重疾险”,其中对各个标签赋值,“性别”标签为2,“年龄”标签为,6,“职业”标签为5,“年收入”标签为9,“家庭资产”标签为13,分别将其转译为标签形式,则为李四={2,6,5,9,13,}。In this embodiment, the user information is processed according to the preset label format to generate field information in the label format, wherein each generated field information in the label format is a label. Among them, the generated set of multiple tags is called tag set. For example, the user's reply message is "My name is Li Si, gender is female, I am a teacher, I am 43 years old this year, my annual income is 150,000 yuan, and my family assets are about 2 million yuan. I want to buy a critical illness insurance." Assignment, the "gender" label is 2, the "age" label is 6, the "occupation" label is 5, the "annual income" label is 9, and the "family assets" label is 13, and they are translated into label forms, then Li Si = {2,6,5,9,13,}.
步骤S1402,若所述用户购买意图为正向意图,则基于所述标签的预设权重值,分别将所述各标签与预置产品适配表中的产品进行匹配,计算并输出各标签与对应产品的匹配得分;Step S1402, if the purchase intention of the user is a positive intention, then based on the preset weight value of the tag, the tags are respectively matched with the products in the preset product adaptation table, and each tag is calculated and output. The matching score of the corresponding product;
本实施例中,若用户的购买意图为正向意图,则根据各个标签的权重值,分别将各个标签与产品适配表中的多个保险产品进行匹配,计算每个标签与产品适配表中每种产品的匹配得分。比如,将用户答复信息“我叫李四,性别女,是一名教师,今年43岁,年收入15万元,家庭资产大概有200万,想买一份重疾险”转译为标签形式,则为李四={2,6,5,9,13},分别将标签2,标签6,标签5,标签9,标签13与预置产品适配表中的产品A,产品B,产品C,产品D,产品E进行匹配,根据各标签的权重值,计算并输出各标签与对应产品的匹配得分。In this embodiment, if the user's purchase intention is a positive intention, each tag is matched with multiple insurance products in the product adaptation table according to the weight value of each tag, and each tag and product adaptation table is calculated. Match score for each product in . For example, the user's reply message "My name is Li Si, gender is female, I am a teacher, I am 43 years old this year, my annual income is 150,000 yuan, and my family assets are about 2 million yuan. I want to buy a critical illness insurance" into a label format, Then it is Li Si = {2, 6, 5, 9, 13}, respectively, label 2, label 6, label 5, label 9, label 13 and product A, product B, product C in the preset product adaptation table , product D, and product E are matched, and according to the weight value of each label, the matching score between each label and the corresponding product is calculated and output.
步骤S1403,基于所述各标签与对应产品的匹配得分,将所述各标签与对应产品的匹配得分进行求和,得到各产品的匹配程度得分;Step S1403, based on the matching score of each label and the corresponding product, summing the matching score of each label and the corresponding product to obtain the matching degree score of each product;
本实施例中,根据每个标签与每种产品的匹配得分,将每个标签的匹配得分进行求和,得到产品适配表中每种产品的匹配程度的分。比如说,用户答复信息“我叫李四,性别女,是一名教师,今年43岁,年收入15万元,家庭资产大概有200万,想买一份重疾险”转译为标签形式,则为李四={2,6,5,9,13},分别将标签2,标签6,标签5,标签9,标签13,将各标签与预置产品适配表中的产品A进行匹配,匹配得分分别为5,7,2,11,7,将这些标签的匹配得分进行求和,所以产品A的匹配程度的分为5+7+2+11+7=32,依次得到预置产品适配表中所有产品的匹配程度的分。In this embodiment, according to the matching scores of each label and each product, the matching scores of each label are summed to obtain the score of the matching degree of each product in the product adaptation table. For example, the user's reply message "My name is Li Si, gender is female, I am a teacher, I am 43 years old this year, my annual income is 150,000 yuan, and my family assets are about 2 million yuan. I want to buy a critical illness insurance" and translate it into a label. Then Li Si = {2, 6, 5, 9, 13}, respectively label 2, label 6, label 5, label 9, label 13, and match each label with product A in the preset product adaptation table , the matching scores are 5, 7, 2, 11, 7, respectively, and the matching scores of these tags are summed, so the matching degree of product A is divided into 5+7+2+11+7=32, and the presets are obtained in turn Score for the degree of fit of all products in the product fit table.
步骤S1404,若所述产品的匹配程度得分高于预设阈值,则将所述产品标记为推荐,若否,则标记为非推荐;Step S1404, if the matching degree score of the product is higher than a preset threshold, mark the product as recommended, if not, mark it as non-recommended;
本实施例中,根据各个产品的匹配度得分,若产品匹配度得分高于预设值,则将该产品标记为推荐,若产品匹配度得分低于预设值,则标记为非推荐。比如,预置产品适配表中的所有产品的匹配得分分别为35,47,55,55,85,若产品的匹配程度得分高于预设阈值,则将该产品标记为推荐。若否,则标记为非推荐。In this embodiment, according to the matching degree score of each product, if the product matching degree score is higher than the preset value, the product is marked as recommended, and if the product matching degree score is lower than the preset value, it is marked as non-recommended. For example, the matching scores of all products in the preset product adaptation table are 35, 47, 55, 55, and 85, respectively. If the matching degree score of the product is higher than the preset threshold, the product is marked as recommended. If not, mark as non-recommended.
骤S1405,将标记为推荐的预设数量个产品作为推荐结果,并输出。In step S1405, the preset number of products marked as recommended are taken as the recommended results, and output.
本实施例中,将标记为推荐的一定数量个产品作为推荐结果,发送至客户端,呈现给用户。比如,预置产品适配表中的所有产品产品A,产品B,产品C,产品D,产品E的匹配得分分别为35,47,55,55,85,若产品的匹配程度得分高于预设阈值50,则将产品C,产品D,产品E标记为推荐,并将产品C,产品D,产品E推荐给用户。In this embodiment, a certain number of products marked as recommended are sent to the client as the recommendation results and presented to the user. For example, the matching scores of all products Product A, Product B, Product C, Product D, and Product E in the preset product adaptation table are 35, 47, 55, 55, and 85, respectively. If the threshold is set to 50, product C, product D, and product E are marked as recommended, and product C, product D, and product E are recommended to users.
参照图8,图8为本发明产品智能匹配装置一实施例的功能模块示意图。本实施例中,所述产品智能匹配装置包括:Referring to FIG. 8 , FIG. 8 is a schematic diagram of functional modules of an embodiment of an intelligent product matching device of the present invention. In this embodiment, the product intelligent matching device includes:
获取模块10,用于基于预置问题集与用户进行多轮对话,并获取每一轮对话的用户答复信息,以收集用户信息;The acquiring
情绪识别模块20,用于对所述用户答复信息进行情绪识别,得到当前用户情绪;The
意图识别模块30,对所述用户答复信息进行意图识别,并基于所述用户情绪,确定所述用户购买意图;an
产品推荐模块40,用于当用户的意图为正向意图时,基于所述用户信息,查询预置产品适配表,输出与所述用户信息匹配的产品。The
可选地,在一具体实施例中,所述产品智能匹配装置还包括:Optionally, in a specific embodiment, the product intelligent matching device further includes:
采集模块,用于通过网络爬虫技术,采集指定网站显示的预置格式的所有产品文档信息;The collection module is used to collect all the product document information in the preset format displayed on the designated website through the web crawler technology;
构建模块,用于对所述产品文档信息进行实体关系抽取处理,得到所述产品文档信息中各产品知识点的命名实体对象以及命名实体对象之间的实体关系;根据各产品知识点的命名实体对象,采用正则表达式识别所述产品文档信息中各命名实体对象对应的实体属性;以各产品知识点对应的命名实体对象、实体属性以及实体关系为源数据,构建基于图数据库的产品知识图谱;The building module is used to perform entity relationship extraction processing on the product document information, and obtain the named entity objects of each product knowledge point in the product document information and the entity relationship between the named entity objects; according to the named entity objects of each product knowledge point object, using regular expressions to identify the entity attributes corresponding to each named entity object in the product document information; using the named entity objects, entity attributes and entity relationships corresponding to each product knowledge point as source data to build a graph database-based product knowledge map ;
创建模块,用于基于所述产品知识图谱,创建所述问题集与所述产品适配表。A creation module is configured to create the question set and the product adaptation table based on the product knowledge graph.
可选地,在一具体实施例中,所述获取模块包括:Optionally, in a specific embodiment, the obtaining module includes:
获取单元,用于获取每一轮对话的用户答复信息;an acquisition unit, used to acquire the user's reply information for each round of dialogue;
槽位匹配单元,用于将所述用户答复信息与预置问题集中的预设槽位进行匹配;a slot matching unit, configured to match the user reply information with a preset slot in a preset question set;
抽取单元,用于若所述用户答复信息与所述预设槽位匹配成功,则抽取所述预设槽位;an extraction unit, configured to extract the preset slot if the user reply information is successfully matched with the preset slot;
关联单元,用于若所述用户答复信息中存在与所述预设槽位匹配的答案信息,则将所述答案信息与所述预设槽位关联;an association unit, configured to associate the answer information with the preset slot if there is answer information matching the preset slot in the user reply information;
收集单元,用于基于所述关联结果,收集用户信息。A collection unit, configured to collect user information based on the association result.
可选地,在一具体实施例中,所述情绪识别模块包括:Optionally, in a specific embodiment, the emotion recognition module includes:
分割单元,用于将所述用户答复信息分割成多个数据区段;a dividing unit, configured to divide the user reply information into a plurality of data sections;
判断单元,用于分别对所述数据区段进行情绪判断,得到各种情绪的程度属性数值;a judging unit, which is used to judge the emotions of the data sections respectively, and obtain the degree attribute values of various emotions;
情绪识别单元,用于基于所述各数据区段情绪的程度属性数值,分别确定所述数据区段的情绪标签;基于所述各数据区段的情绪标签,输出各情绪对应的情绪标签概率分布图;基于所述情绪标签概率分布图,得到所述用户情绪。An emotion recognition unit, configured to respectively determine the emotion labels of the data segments based on the degree attribute value of the emotion of each data segment; output the emotion label probability distribution corresponding to each emotion based on the emotion labels of the various data segments Figure; based on the emotion label probability distribution map, the user emotion is obtained.
可选地,在一具体实施例中,所述意图识别模块包括:Optionally, in a specific embodiment, the intent recognition module includes:
分析单元,用于对所述用户答复信息以词为单位进行分析得到命名实体作为用户意图参数候选;an analysis unit, configured to analyze the user reply information in word units to obtain a named entity as a user intent parameter candidate;
意图匹配单元,用于对所述用户答复信息进行解析,并根据预设的用户意图关键候选集逐词模糊匹配,得到意图关键词;an intent matching unit, configured to parse the user reply information, and obtain intent keywords by word-by-word fuzzy matching according to a preset user intent key candidate set;
意图识别单元,用于基于所述意图关键词输出用户意图识别结果;基于所述用户意图识别结果和所述用户情绪,确定所述用户的购买意图。An intent identification unit, configured to output a user intent identification result based on the intent keyword; and determine the user's purchase intent based on the user intent identification result and the user emotion.
可选地,在一具体实施例中,所述产品推荐模块包括:Optionally, in a specific embodiment, the product recommendation module includes:
格式转译单元,用于将所述用户信息转译为标签格式,生成所述用户信息对应的多个标签;a format translation unit, configured to translate the user information into a label format, and generate a plurality of labels corresponding to the user information;
计算单元,用于若所述用户购买意图为正向意图,则基于所述标签的预设权重值,分别将所述各标签与预置产品适配表中的产品进行匹配,计算并输出各标签与对应产品的匹配得分;基于所述各标签与对应产品的匹配得分,将所述各标签与对应产品的匹配得分进行求和,得到各产品的匹配程度得分;The calculation unit is configured to, if the purchase intention of the user is a positive intention, respectively match the tags with the products in the preset product adaptation table based on the preset weight values of the tags, calculate and output each tag. The matching score of the label and the corresponding product; based on the matching score of each label and the corresponding product, the matching score of each label and the corresponding product is summed to obtain the matching degree score of each product;
产品推荐单元,用于若所述产品的匹配程度得分高于预设阈值,则将所述产品标记为推荐,若否,则标记为非推荐;将标记为推荐的预设数量个产品作为推荐结果并输出。A product recommendation unit, configured to mark the product as recommended if the matching degree score of the product is higher than a preset threshold, and if not, mark the product as non-recommended; take a preset number of products marked as recommended as recommendations result and output.
本实施例提出的产品智能匹配方法,完善了目前市面上存在的所谓的智能保险助手业务模式和技术方面的缺陷,提供了以用户信息和用户需求为基准的产品智能匹配方法。通过网络爬虫技术采集指定网站的产品文档信息,并对产品文档信息进行加工整理,构建针对产品知识点的问题集,以供进行多轮对话收集用户信息从而避免简单的正向收集用户信息时可能造成的信息缺失或对保险理解不一致。同时,基于多个维度收集用户个性化信息,通过预设模型的知识图谱问答解决用户需求和产品匹配的问题。通过用户与产品适配模型匹配后,能进行精准的产品推荐。The product intelligent matching method proposed in this embodiment improves the current market deficiencies in the so-called intelligent insurance assistant business model and technology, and provides a product intelligent matching method based on user information and user needs. Collect the product document information of the designated website through the web crawler technology, process the product document information, and build a problem set for product knowledge points for multiple rounds of dialogue to collect user information, so as to avoid the possibility of simple positive collection of user information. The resulting lack of information or inconsistent understanding of insurance. At the same time, it collects user personalized information based on multiple dimensions, and solves the problem of user needs and product matching through the knowledge graph question and answer of the preset model. After the user is matched with the product adaptation model, accurate product recommendation can be made.
基于与上述本发明产品智能匹配方法相同的实施例说明内容,因此本实施例对产品智能匹配装置的实施例内容不做过多赘述。Based on the description content of the above-mentioned embodiment of the intelligent product matching method of the present invention, the content of the embodiment of the product intelligent matching device will not be described in detail in this embodiment.
本发明还提供一种计算机可读存储介质。The present invention also provides a computer-readable storage medium.
本实施例中,所述计算机可读存储介质上存储有产品智能匹配程序,所述产品智能匹配程序被处理器执行时实现如上述任一项实施例中所述的产品智能匹配方法的步骤。其中,产品智能匹配程序被处理器执行时所实现的方法可参照本发明产品智能匹配方法的各个实施例,因此不再过多赘述。In this embodiment, a product intelligent matching program is stored on the computer-readable storage medium, and when the product intelligent matching program is executed by the processor, the steps of the product intelligent matching method described in any of the above embodiments are implemented. The method implemented when the product intelligent matching program is executed by the processor may refer to the various embodiments of the product intelligent matching method of the present invention, and thus will not be described repeatedly.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质(如ROM/RAM)中,包括若干指令用以使得一台终端(可以是手机,计算机,服务器或者网络设备等)执行本发明各个实施例所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that the method of the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course can also be implemented by hardware, but in many cases the former is better implementation. Based on this understanding, the technical solutions of the present invention essentially or the parts that contribute to the prior art can be embodied in the form of software products, and the computer software products are stored in a storage medium (such as ROM/RAM), including Several instructions are used to cause a terminal (which may be a mobile phone, a computer, a server, or a network device, etc.) to execute the methods described in the various embodiments of the present invention.
上面结合附图对本发明的实施例进行了描述,但是本发明并不局限于上述的具体实施方式,上述的具体实施方式仅仅是示意性的,而不是限制性的,本领域的普通技术人员在本发明的启示下,在不脱离本发明宗旨和权利要求所保护的范围情况下,还可做出很多形式,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,这些均属于本发明的保护之内。The embodiments of the present invention have been described above in conjunction with the accompanying drawings, but the present invention is not limited to the above-mentioned specific embodiments, which are merely illustrative rather than restrictive. Under the inspiration of the present invention, without departing from the scope of protection of the purpose of the present invention and the claims, many forms can be made. Directly or indirectly applied in other related technical fields, these all belong to the protection of the present invention.
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010090301.3A CN111339284A (en) | 2020-02-13 | 2020-02-13 | Product intelligent matching method, device, device and readable storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010090301.3A CN111339284A (en) | 2020-02-13 | 2020-02-13 | Product intelligent matching method, device, device and readable storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111339284A true CN111339284A (en) | 2020-06-26 |
Family
ID=71183572
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010090301.3A Pending CN111339284A (en) | 2020-02-13 | 2020-02-13 | Product intelligent matching method, device, device and readable storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111339284A (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112149429A (en) * | 2020-10-21 | 2020-12-29 | 成都小美伴旅信息技术有限公司 | High-accuracy semantic understanding and identifying method based on word slot order model |
CN112233699A (en) * | 2020-10-13 | 2021-01-15 | 中移(杭州)信息技术有限公司 | Voice broadcast method, intelligent voice device and computer-readable storage medium |
CN112528002A (en) * | 2020-12-23 | 2021-03-19 | 北京百度网讯科技有限公司 | Dialog recognition method and device, electronic equipment and storage medium |
CN113010784A (en) * | 2021-03-17 | 2021-06-22 | 北京十一贝科技有限公司 | Method, apparatus, electronic device, and medium for generating prediction information |
CN113256437A (en) * | 2021-07-15 | 2021-08-13 | 大童保险销售服务有限公司 | Method, system and storage medium for configuring disease insurance product scheme |
CN113888203A (en) * | 2021-09-04 | 2022-01-04 | 北京优全智汇信息技术有限公司 | Online insurance product sale system and sale method |
CN113971616A (en) * | 2021-11-26 | 2022-01-25 | 北京优全智汇信息技术有限公司 | User demand-based insurance product accurate selection system and selection method |
CN116737901A (en) * | 2023-06-14 | 2023-09-12 | 北京安博通科技股份有限公司 | Intelligent question and answer method, electronic equipment and media based on network security equipment questions |
CN117235234A (en) * | 2023-11-08 | 2023-12-15 | 深圳市腾讯计算机系统有限公司 | Object information acquisition method, device, computer equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109146610A (en) * | 2018-07-16 | 2019-01-04 | 众安在线财产保险股份有限公司 | It is a kind of intelligently to insure recommended method, device and intelligence insurance robot device |
CN109727092A (en) * | 2018-12-15 | 2019-05-07 | 深圳壹账通智能科技有限公司 | AI-based product recommendation method, device, computer equipment and storage medium |
CN109949071A (en) * | 2019-01-31 | 2019-06-28 | 平安科技(深圳)有限公司 | Products Show method, apparatus, equipment and medium based on voice mood analysis |
CN110738545A (en) * | 2019-08-30 | 2020-01-31 | 深圳壹账通智能科技有限公司 | Product recommendation method and device based on user intention identification, computer equipment and storage medium |
-
2020
- 2020-02-13 CN CN202010090301.3A patent/CN111339284A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109146610A (en) * | 2018-07-16 | 2019-01-04 | 众安在线财产保险股份有限公司 | It is a kind of intelligently to insure recommended method, device and intelligence insurance robot device |
CN109727092A (en) * | 2018-12-15 | 2019-05-07 | 深圳壹账通智能科技有限公司 | AI-based product recommendation method, device, computer equipment and storage medium |
CN109949071A (en) * | 2019-01-31 | 2019-06-28 | 平安科技(深圳)有限公司 | Products Show method, apparatus, equipment and medium based on voice mood analysis |
CN110738545A (en) * | 2019-08-30 | 2020-01-31 | 深圳壹账通智能科技有限公司 | Product recommendation method and device based on user intention identification, computer equipment and storage medium |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112233699B (en) * | 2020-10-13 | 2023-04-28 | 中移(杭州)信息技术有限公司 | Voice broadcasting method, intelligent voice equipment and computer readable storage medium |
CN112233699A (en) * | 2020-10-13 | 2021-01-15 | 中移(杭州)信息技术有限公司 | Voice broadcast method, intelligent voice device and computer-readable storage medium |
CN112149429A (en) * | 2020-10-21 | 2020-12-29 | 成都小美伴旅信息技术有限公司 | High-accuracy semantic understanding and identifying method based on word slot order model |
CN112528002A (en) * | 2020-12-23 | 2021-03-19 | 北京百度网讯科技有限公司 | Dialog recognition method and device, electronic equipment and storage medium |
CN112528002B (en) * | 2020-12-23 | 2023-07-18 | 北京百度网讯科技有限公司 | Dialogue identification method, device, electronic equipment and storage medium |
CN113010784A (en) * | 2021-03-17 | 2021-06-22 | 北京十一贝科技有限公司 | Method, apparatus, electronic device, and medium for generating prediction information |
CN113010784B (en) * | 2021-03-17 | 2024-02-06 | 北京十一贝科技有限公司 | Method, apparatus, electronic device and medium for generating prediction information |
CN113256437B (en) * | 2021-07-15 | 2021-11-02 | 大童保险销售服务有限公司 | Method, system and storage medium for configuring disease insurance product scheme |
CN113256437A (en) * | 2021-07-15 | 2021-08-13 | 大童保险销售服务有限公司 | Method, system and storage medium for configuring disease insurance product scheme |
CN113888203A (en) * | 2021-09-04 | 2022-01-04 | 北京优全智汇信息技术有限公司 | Online insurance product sale system and sale method |
CN113971616A (en) * | 2021-11-26 | 2022-01-25 | 北京优全智汇信息技术有限公司 | User demand-based insurance product accurate selection system and selection method |
CN116737901A (en) * | 2023-06-14 | 2023-09-12 | 北京安博通科技股份有限公司 | Intelligent question and answer method, electronic equipment and media based on network security equipment questions |
CN117235234A (en) * | 2023-11-08 | 2023-12-15 | 深圳市腾讯计算机系统有限公司 | Object information acquisition method, device, computer equipment and storage medium |
CN117235234B (en) * | 2023-11-08 | 2024-03-01 | 深圳市腾讯计算机系统有限公司 | Object information acquisition method, device, computer equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111339284A (en) | Product intelligent matching method, device, device and readable storage medium | |
CN106776711B (en) | Chinese medical knowledge map construction method based on deep learning | |
US10528329B1 (en) | Methods, systems, and computer program product for automatic generation of software application code | |
RU2657173C2 (en) | Sentiment analysis at the level of aspects using methods of machine learning | |
US11699034B2 (en) | Hybrid artificial intelligence system for semi-automatic patent infringement analysis | |
US8972408B1 (en) | Methods, systems, and articles of manufacture for addressing popular topics in a social sphere | |
RU2635257C1 (en) | Sentiment analysis at level of aspects and creation of reports using machine learning methods | |
US20170103439A1 (en) | Searching Evidence to Recommend Organizations | |
CN116244344B (en) | Retrieval method, device and electronic equipment based on user needs | |
US10740406B2 (en) | Matching of an input document to documents in a document collection | |
US20220414463A1 (en) | Automated troubleshooter | |
CN111242710A (en) | Business classification processing method and device, service platform and storage medium | |
US8862609B2 (en) | Expanding high level queries | |
CN113407677A (en) | Method, apparatus, device and storage medium for evaluating quality of consultation session | |
CN116127013A (en) | A personal sensitive information knowledge map query method and device | |
CN119646016A (en) | Data query method, device, electronic device, medium and program product | |
CN119474270A (en) | Message interaction method and device based on AI customer service robot | |
CN119228386A (en) | Optimization method, system, device and medium of intelligent customer service system | |
CN119322831A (en) | AI-based generation type question-answering method and device | |
CN118885587A (en) | Question and answer processing method, device and non-volatile storage medium | |
CN118760733A (en) | A text processing method, device, electronic device, computer-readable storage medium and computer program product | |
US20210263915A1 (en) | Search Text Generation System and Search Text Generation Method | |
US11017172B2 (en) | Proposition identification in natural language and usage thereof for search and retrieval | |
Lee | Use-centric mining of customer reviews | |
CN112558913A (en) | Conversation method and device based on aggregated card, computer equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |