[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN106875278A - Social network user portrait method based on random forest - Google Patents

Social network user portrait method based on random forest Download PDF

Info

Publication number
CN106875278A
CN106875278A CN201710038836.4A CN201710038836A CN106875278A CN 106875278 A CN106875278 A CN 106875278A CN 201710038836 A CN201710038836 A CN 201710038836A CN 106875278 A CN106875278 A CN 106875278A
Authority
CN
China
Prior art keywords
attribute
random forest
social network
label
similarity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710038836.4A
Other languages
Chinese (zh)
Other versions
CN106875278B (en
Inventor
琚春华
胡坤
鲍福光
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Gongshang University
Original Assignee
Zhejiang Gongshang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Gongshang University filed Critical Zhejiang Gongshang University
Priority to CN201710038836.4A priority Critical patent/CN106875278B/en
Publication of CN106875278A publication Critical patent/CN106875278A/en
Application granted granted Critical
Publication of CN106875278B publication Critical patent/CN106875278B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Computing Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Tourism & Hospitality (AREA)
  • Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Image Analysis (AREA)

Abstract

本发明提供了一种基于随机森林的社交网络用户画像方法,具体包括如下步骤:获取在线社交网站的多源属性数据;将原始多源属性的数据属性集合进行原始属性标号,调用相似度函数对不同属性的数据属性集合遍历相似检测;根据原始单层多源属性的决策树,将相似度满足阈值范围的数据属性集合合并生成合并属性标签后,采用随机森林算法训练样本;获取投票众数,将获得的投票众数赋予权重,再按照权重的由大到小排序,获取全部的标签权重值;保留预设阈值内的标签,形成新的标签属性集用于用户社交网络中属性的画像。本发明目的在于采用随机森林模型,用于用户的属性标签划分,有效改进了传统的基于小样本抽样划分属性的不足和复杂度的问题。

The present invention provides a social network user portrait method based on random forest, which specifically includes the following steps: obtaining multi-source attribute data of online social networking sites; labeling the data attribute sets of original multi-source attributes with original attribute labels, and calling similarity function pairs The data attribute sets of different attributes are traversed for similarity detection; according to the decision tree of the original single-layer multi-source attribute, the data attribute sets whose similarity meets the threshold range are merged to generate the merged attribute label, and the random forest algorithm is used to train the samples; the voting majority is obtained, Assign weights to the obtained voting majority, and then sort according to the weights from large to small to obtain all tag weight values; retain tags within the preset threshold to form a new tag attribute set for the portrait of attributes in the user's social network. The purpose of the present invention is to adopt a random forest model for user's attribute label division, which effectively improves the problem of deficiency and complexity of the traditional attribute division based on small sample sampling.

Description

基于随机森林的社交网络用户画像方法Random forest based user portrait method for social network

技术领域technical field

本发明涉及在线社会网络技术领域,特别涉及一种基于随机森林的社交网络用户画像方法。The invention relates to the technical field of online social networks, in particular to a random forest-based social network user portrait method.

背景技术Background technique

在线社会网络的研究是近年来学术研究的重点领域,我国有着世界上规模最大的互联网网民,因此,在互联网的前期推广阶段和现阶段的使用过程中产生了大量的数据。绝大多数的数据资源被闲置,不能很好的处理和商业化应用,造成巨大的损失,同时也不利于社交网络的进一步发展,各大互联网公司纷纷投入巨大的财力和人力对在线社会关系领域开展一系列研究,把互联网的数据资源合理的开发和使用意义重大。The research on online social network is the key field of academic research in recent years. my country has the largest Internet users in the world. Therefore, a large amount of data has been generated in the early stage of Internet promotion and current use. The vast majority of data resources are idle and cannot be well processed and commercialized, causing huge losses, and it is not conducive to the further development of social networks. Major Internet companies have invested huge financial resources and manpower in the field of online social relations. It is of great significance to carry out a series of researches on the rational development and use of Internet data resources.

发明内容Contents of the invention

本发明提供一种基于随机森林的社交网络用户画像方法,目的在于采用随机森林模型,用于用户的属性标签划分,有效改进了传统的基于小样本抽样划分属性的不足和复杂度的问题。The present invention provides a social network user portrait method based on random forest, aiming at adopting random forest model for user's attribute label division, effectively improving the problem of deficiency and complexity of traditional attribute division based on small sample sampling.

为解决上述问题,本发明实施例提供一种基于随机森林的社交网络用户画像方法,具体包括如下步骤:In order to solve the above problems, an embodiment of the present invention provides a random forest-based social network user portrait method, which specifically includes the following steps:

获取在线社交网站的多源属性数据;Obtain multi-source attribute data for online social networking sites;

将原始多源属性的数据属性集合进行原始属性标号,调用相似度函数对不同属性的数据属性集合遍历相似检测;Label the data attribute collection of the original multi-source attribute with the original attribute label, and call the similarity function to traverse the similarity detection for the data attribute collection of different attributes;

根据原始单层多源属性的决策树,将相似度满足阈值范围的数据属性集合合并生成合并属性标签后,采用随机森林算法训练样本;According to the decision tree of the original single-layer multi-source attributes, the data attribute sets whose similarity meets the threshold range are merged to generate the merged attribute labels, and the random forest algorithm is used to train the samples;

获取投票众数,将获得的投票众数赋予权重,再按照权重的由大到小排序,获取全部的标签权重值;Obtain the majority of votes, assign weights to the obtained majority of votes, and then sort according to the weights from large to small to obtain all tag weight values;

保留预设阈值内的标签,形成新的标签属性集用于用户社交网络中属性的画像。The tags within the preset threshold are retained to form a new tag attribute set for the portrait of attributes in the user's social network.

作为一种实施方式,还包括以下步骤:As an implementation mode, the following steps are also included:

设定最低检测终止阈值,当相似度小于最低检测终止阈值时,终止该集合的相似度检测。Set the minimum detection termination threshold, when the similarity is less than the minimum detection termination threshold, the similarity detection of this set is terminated.

作为一种实施方式,所述最低检测终止阈值为0.15。As an implementation manner, the minimum detection termination threshold is 0.15.

作为一种实施方式,所述相似度函数为:As an implementation manner, the similarity function is:

其中,α为相似度调节参数,α∈[0,1],ω(x)代表标签相似度较高的两种属性函数。Among them, α is the similarity adjustment parameter, α∈[0,1], ω(x) represents two attribute functions with higher label similarity.

作为一种实施方式,所述α取值为0.001。As an implementation manner, the value of α is 0.001.

作为一种实施方式,所述保留预设阈值内的标签,形成新的标签属性集用于用户社交网络中属性的画像步骤,具体包括以下步骤:As an implementation, the step of retaining tags within the preset threshold and forming a new tag attribute set for the portrait of attributes in the user's social network specifically includes the following steps:

设定标签众数阈值,当随机森林算法获取的投票众数小于标签众数时,则认为该标签不具代表性,舍弃该标签;Set the label mode threshold. When the vote mode obtained by the random forest algorithm is less than the label mode, the label is considered unrepresentative and the label is discarded;

将保留后的标签根据标签权重值由大到小排序,形成新的标签属性集。The reserved tags are sorted according to the tag weight value from large to small to form a new tag attribute set.

作为一种实施方式,所述相似度阈值范围为[0.9,1]。As an implementation manner, the range of the similarity threshold is [0.9, 1].

本发明相比于现有技术的有益效果在于:采用随机森林模型,用于用户的属性标签划分,有效改进了传统的基于小样本抽样划分属性的不足和复杂度的问题。Compared with the prior art, the present invention has the beneficial effects of adopting the random forest model for user's attribute label division, effectively improving the problem of deficiency and complexity of the traditional attribute division based on small sample sampling.

附图说明Description of drawings

图1为本发明的基于随机森林的社交网络用户画像方法的流程图。FIG. 1 is a flow chart of the random forest-based social network user portrait method of the present invention.

具体实施方式detailed description

以下结合附图,对本发明上述的和另外的技术特征和优点进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明的部分实施例,而不是全部实施例。The above and other technical features and advantages of the present invention will be clearly and completely described below in conjunction with the accompanying drawings. Apparently, the described embodiments are only some of the embodiments of the present invention, not all of them.

如图所示,一种基于随机森林的社交网络用户画像方法,具体包括如下步骤:As shown in the figure, a random forest-based social network user portrait method specifically includes the following steps:

S100:获取在线社交网站的多源属性数据,将其导入数据存储系统;S100: Obtain multi-source attribute data of online social networking sites, and import them into a data storage system;

S101:将原始多源属性的数据属性集合进行原始属性标号,调用相似度函数对不同属性的集合遍历相似检测,相似度函数为:S101: Label the data attribute set of the original multi-source attribute with the original attribute, and call the similarity function to traverse the similarity detection for the collection of different attributes. The similarity function is:

其中,其中,α为相似度调节参数,α∈[0,1],ω(x)代表标签相似度较高的两种属性函数。但是实际中α取值一般非常小,依赖于样本的测试取值不断修正,根据实验结果表明当α提高一个数量级时,选择的特征非常少,而当α降低一个数量级时获取的数值几乎不变,因此,本实施例中α使用0.001;Among them, α is the similarity adjustment parameter, α∈[0,1], ω(x) represents two attribute functions with higher label similarity. However, in practice, the value of α is generally very small, and the test value dependent on the sample is constantly revised. According to the experimental results, when α is increased by an order of magnitude, the selected features are very few, and when α is reduced by an order of magnitude, the obtained value is almost unchanged. , therefore, in this embodiment, α uses 0.001;

S102:设定最低检测终止阈值,当相似度小于最低检测终止阈值时,终止该集合的相似度检测,其中,最低检测终止阈值为0.15;S102: Set the lowest detection termination threshold, when the similarity is less than the lowest detection termination threshold, terminate the similarity detection of the set, wherein the minimum detection termination threshold is 0.15;

S103:根据原始单层多源属性的决策树,将相似度满足阈值范围的集合合并生成合并属性标签后,采用随机森林算法训练样本,相似度阈值范围为[0.9,1];S103: According to the decision tree of the original single-layer multi-source attribute, after merging the sets whose similarity meets the threshold range to generate the merged attribute label, use the random forest algorithm to train samples, and the similarity threshold range is [0.9,1];

S104:获取投票众数,将获得的投票众数赋予权重,再按照权重的由大到小排序,获取全部的标签权重值;S104: Obtain the majority of votes, assign weights to the obtained majority of votes, and then sort according to the weights from large to small to obtain all tag weight values;

S105:保留预设阈值内的标签,形成新的标签属性集用于用户社交网络中属性的画像,具体实施方式为:设定标签众数阈值,当随机森林算法获取的投票众数小于标签众数阈值时,则认为该标签不具代表性,舍弃该标签;将保留后的标签根据标签权重值由大到小排序,形成新的标签属性集,新的标签属性集用于社交网络的用户画像。S105: Keep the tags within the preset threshold, and form a new tag attribute set for the portrait of attributes in the user's social network. The specific implementation method is: set the tag mode threshold, when the voting mode obtained by the random forest algorithm is less than the When the threshold is higher than the threshold, the label is considered unrepresentative and discarded; the reserved labels are sorted according to the label weight value from large to small to form a new label attribute set, and the new label attribute set is used for user portraits in social networks .

本发明相比于现有技术的有益效果在于:采用随机森林模型,用于用户的属性标签划分,有效改进了传统的基于小样本抽样划分属性的不足和复杂度的问题。Compared with the prior art, the present invention has the beneficial effects of adopting the random forest model for user's attribute label division, effectively improving the problem of deficiency and complexity of the traditional attribute division based on small sample sampling.

以上所述的具体实施例,对本发明的目的、技术方案和有益效果进行了进一步的详细说明,应当理解,以上所述仅为本发明的具体实施例而已,并不用于限定本发明的保护范围。特别指出,对于本领域技术人员来说,凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The specific embodiments described above have further described the purpose, technical solutions and beneficial effects of the present invention in detail. It should be understood that the above descriptions are only specific embodiments of the present invention and are not intended to limit the protection scope of the present invention. . In particular, for those skilled in the art, any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present invention shall be included within the protection scope of the present invention.

Claims (7)

1.一种基于随机森林的社交网络用户画像方法,其特征在于,具体包括如下步骤:1. A social network user portrait method based on random forest, is characterized in that, specifically comprises the steps: 获取在线社交网站的多源属性数据;Obtain multi-source attribute data for online social networking sites; 将原始多源属性的数据属性集合进行原始属性标号,调用相似度函数对不同属性的数据属性集合遍历相似检测;Label the data attribute collection of the original multi-source attribute with the original attribute label, and call the similarity function to traverse the similarity detection for the data attribute collection of different attributes; 根据原始单层多源属性的决策树,将相似度满足阈值范围的数据属性集合合并生成合并属性标签后,采用随机森林算法训练样本;According to the decision tree of the original single-layer multi-source attributes, the data attribute sets whose similarity meets the threshold range are merged to generate the merged attribute labels, and the random forest algorithm is used to train the samples; 获取投票众数,将获得的投票众数赋予权重,再按照权重的由大到小排序,获取全部的标签权重值;Obtain the majority of votes, assign weights to the obtained majority of votes, and then sort according to the weights from large to small to obtain all tag weight values; 保留预设阈值内的标签,形成新的标签属性集用于用户社交网络中属性的画像。The tags within the preset threshold are retained to form a new tag attribute set for the portrait of attributes in the user's social network. 2.根据权利要求1所述的基于随机森林的社交网络用户画像方法,其特征在于,还包括以下步骤:2. the social network user portrait method based on random forest according to claim 1, is characterized in that, also comprises the following steps: 设定最低检测终止阈值,当相似度小于最低检测终止阈值时,终止该集合的相似度检测。Set the minimum detection termination threshold, when the similarity is less than the minimum detection termination threshold, the similarity detection of this set is terminated. 3.根据权利要求2所述的基于随机森林的社交网络用户画像方法,其特征在于,所述最低检测终止阈值为0.15。3. The random forest-based social network user portrait method according to claim 2, wherein the minimum detection termination threshold is 0.15. 4.根据权利要求1所述的基于随机森林的社交网络用户画像方法,其特征在于,所述相似度函数为:4. the social network user portrait method based on random forest according to claim 1, is characterized in that, described similarity function is: 其中,α为相似度调节参数,α∈0,1,ω(x)代表标签相似度较高的两种属性函数。Among them, α is the similarity adjustment parameter, α∈0,1, ω(x) represents two attribute functions with higher label similarity. 5.根据权利要求4所述的基于随机森林的社交网络用户画像方法,其特征在于,所述α取值为0.001。5. The random forest-based social network user portrait method according to claim 4, wherein the value of α is 0.001. 6.根据权利要求1所述的基于随机森林的社交网络用户画像方法,其特征在于,所述保留预设阈值内的标签,形成新的标签属性集用于用户社交网络中属性的画像步骤,具体包括以下步骤:6. The social network user portrait method based on random forest according to claim 1, characterized in that, the label within the preset threshold is retained to form a new label attribute set for the portrait step of attributes in the user social network, Specifically include the following steps: 设定标签众数阈值,当随机森林算法获取的投票众数小于标签众数时,则认为该标签不具代表性,舍弃该标签;Set the label mode threshold. When the vote mode obtained by the random forest algorithm is less than the label mode, the label is considered unrepresentative and the label is discarded; 将保留后的标签根据标签权重值由大到小排序,形成新的标签属性集。The reserved tags are sorted according to the tag weight value from large to small to form a new tag attribute set. 7.根据权利要求1所述的基于随机森林的社交网络用户画像方法,其特征在于,所述相似度阈值范围为[0.9,1]。7. The random forest-based social network user portrait method according to claim 1, wherein the similarity threshold range is [0.9,1].
CN201710038836.4A 2017-01-19 2017-01-19 A method of social network user portrait based on random forest Active CN106875278B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710038836.4A CN106875278B (en) 2017-01-19 2017-01-19 A method of social network user portrait based on random forest

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710038836.4A CN106875278B (en) 2017-01-19 2017-01-19 A method of social network user portrait based on random forest

Publications (2)

Publication Number Publication Date
CN106875278A true CN106875278A (en) 2017-06-20
CN106875278B CN106875278B (en) 2020-11-03

Family

ID=59157771

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710038836.4A Active CN106875278B (en) 2017-01-19 2017-01-19 A method of social network user portrait based on random forest

Country Status (1)

Country Link
CN (1) CN106875278B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108596444A (en) * 2018-04-02 2018-09-28 清华大学 The method and device of large scale community network user sampling based on diversification strategy
CN108876470A (en) * 2018-06-29 2018-11-23 腾讯科技(深圳)有限公司 Tagging user extended method, computer equipment and storage medium
CN109635190A (en) * 2018-11-28 2019-04-16 四川亨通网智科技有限公司 User characteristics method for digging based on position and behavior Conjoint Analysis
CN109785034A (en) * 2018-11-13 2019-05-21 北京码牛科技有限公司 User's portrait generation method, device, electronic equipment and computer-readable medium
CN110659921A (en) * 2018-06-28 2020-01-07 上海传漾广告有限公司 Method and system for analyzing correlation between network advertisement audience behaviors and audience interests
CN112307831A (en) * 2019-07-31 2021-02-02 广州弘度信息科技有限公司 Violent movement detection method based on human body key point detection and tracking
CN113076476A (en) * 2021-04-01 2021-07-06 重庆邮电大学 User portrait construction method of microblog heterogeneous information
CN115099344A (en) * 2022-06-28 2022-09-23 中国平安人寿保险股份有限公司 Model training method and device, user portrait generation method and device, and equipment

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103678659A (en) * 2013-12-24 2014-03-26 焦点科技股份有限公司 E-commerce website cheat user identification method and system based on random forest algorithm
CN105824912A (en) * 2016-03-15 2016-08-03 平安科技(深圳)有限公司 Personalized recommending method and device based on user portrait
CN105868773A (en) * 2016-03-23 2016-08-17 华南理工大学 Hierarchical random forest based multi-tag classification method
US20160328837A1 (en) * 2015-05-08 2016-11-10 Kla-Tencor Corporation Method and System for Defect Classification

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103678659A (en) * 2013-12-24 2014-03-26 焦点科技股份有限公司 E-commerce website cheat user identification method and system based on random forest algorithm
US20160328837A1 (en) * 2015-05-08 2016-11-10 Kla-Tencor Corporation Method and System for Defect Classification
CN105824912A (en) * 2016-03-15 2016-08-03 平安科技(深圳)有限公司 Personalized recommending method and device based on user portrait
CN105868773A (en) * 2016-03-23 2016-08-17 华南理工大学 Hierarchical random forest based multi-tag classification method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
FENG LIU 等: "MLRF:Multi-label Classification Through Random Forest with Label-Set Partition", 《ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS(ICIC 2015)》 *
刘勘 等: "基于随机森林分类的微博机器用户识别研究", 《北京大学学报(自然科学版)》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108596444A (en) * 2018-04-02 2018-09-28 清华大学 The method and device of large scale community network user sampling based on diversification strategy
CN108596444B (en) * 2018-04-02 2021-06-29 清华大学 Method and device for large-scale social network user sampling based on diversification strategy
CN110659921A (en) * 2018-06-28 2020-01-07 上海传漾广告有限公司 Method and system for analyzing correlation between network advertisement audience behaviors and audience interests
CN108876470A (en) * 2018-06-29 2018-11-23 腾讯科技(深圳)有限公司 Tagging user extended method, computer equipment and storage medium
CN109785034A (en) * 2018-11-13 2019-05-21 北京码牛科技有限公司 User's portrait generation method, device, electronic equipment and computer-readable medium
CN109635190A (en) * 2018-11-28 2019-04-16 四川亨通网智科技有限公司 User characteristics method for digging based on position and behavior Conjoint Analysis
CN112307831A (en) * 2019-07-31 2021-02-02 广州弘度信息科技有限公司 Violent movement detection method based on human body key point detection and tracking
CN112307831B (en) * 2019-07-31 2023-04-14 广州弘度信息科技有限公司 Violent movement detection method based on human body key point detection and tracking
CN113076476A (en) * 2021-04-01 2021-07-06 重庆邮电大学 User portrait construction method of microblog heterogeneous information
CN113076476B (en) * 2021-04-01 2021-11-30 重庆邮电大学 User portrait construction method of microblog heterogeneous information
CN115099344A (en) * 2022-06-28 2022-09-23 中国平安人寿保险股份有限公司 Model training method and device, user portrait generation method and device, and equipment

Also Published As

Publication number Publication date
CN106875278B (en) 2020-11-03

Similar Documents

Publication Publication Date Title
CN106875278B (en) A method of social network user portrait based on random forest
CN113518063B (en) Network intrusion detection method and system based on data enhancement and BilSTM
CN112818023B (en) Big data analysis method and cloud computing server in associated cloud service scene
WO2018032937A1 (en) Method and apparatus for classifying text information
CN113326377B (en) Name disambiguation method and system based on enterprise association relationship
CN108259637A (en) A kind of NAT device recognition methods and device based on decision tree
CN104572736A (en) Keyword extraction method and device based on social networking services
CN104731811B (en) A kind of clustering information evolution analysis method towards extensive dynamic short text
Song Sentiment analysis of Japanese text and vocabulary learning based on natural language processing and SVM
CN114970680A (en) CNN + LSTM-based flow terminal real-time identification method and device
CN115438274A (en) False news identification method based on heterogeneous graph convolutional network
CN107145568A (en) A kind of quick media event clustering system and method
CN105302823A (en) Overlapped community parallel discovery method and system
CN112884121A (en) Traffic identification method based on generation of confrontation deep convolutional network
CN105468780A (en) Normalization method and device of product name entity in microblog text
CN114937311A (en) Time sequence action detection method based on accurate boundary prediction
CN114491168A (en) Method and system for regulating and controlling cloud sample data sharing, computer equipment and storage medium
CN118690260A (en) A big data information processing method, system, medium and server based on deep learning
CN113886547B (en) Client real-time dialogue switching method and device based on artificial intelligence and electronic equipment
CN115630357B (en) A method for determining the behavior of applications that cross-border collection of personal information
CN110502880A (en) A Heterogeneous Identity Association Method Based on Attribute Aggregation
CN103986606A (en) A Method of Parallel Recognition and Statistics of Web Page URLs Based on MapReduce Algorithm
CN113076741A (en) Multilingual text data analysis method
CN110765236A (en) Preprocessing method and system for unstructured mass data
CN114610968B (en) Blog user dividing method based on improved graph rolling network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant