CN111859165A - Real-time personalized information flow recommendation method based on user behaviors - Google Patents
Real-time personalized information flow recommendation method based on user behaviors Download PDFInfo
- Publication number
- CN111859165A CN111859165A CN202010558811.9A CN202010558811A CN111859165A CN 111859165 A CN111859165 A CN 111859165A CN 202010558811 A CN202010558811 A CN 202010558811A CN 111859165 A CN111859165 A CN 111859165A
- Authority
- CN
- China
- Prior art keywords
- user
- content
- behavior
- behaviors
- negative
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000006399 behavior Effects 0.000 title claims abstract description 85
- 238000000034 method Methods 0.000 title claims abstract description 21
- 238000012163 sequencing technique Methods 0.000 claims abstract description 4
- 238000012545 processing Methods 0.000 claims abstract description 3
- 230000008092 positive effect Effects 0.000 claims abstract 2
- 239000013598 vector Substances 0.000 claims description 14
- 230000008451 emotion Effects 0.000 claims description 8
- 238000012549 training Methods 0.000 claims description 7
- 238000013145 classification model Methods 0.000 claims description 6
- 238000001914 filtration Methods 0.000 claims description 4
- 238000007781 pre-processing Methods 0.000 claims description 2
- 230000011218 segmentation Effects 0.000 claims description 2
- 238000010276 construction Methods 0.000 claims 1
- 230000000694 effects Effects 0.000 abstract 1
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013480 data collection Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000013585 weight reducing agent Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9536—Search customisation based on social or collaborative filtering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/211—Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/049—Temporal neural networks, e.g. delay elements, oscillating neurons or pulsed inputs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Biomedical Technology (AREA)
- Molecular Biology (AREA)
- Computing Systems (AREA)
- Evolutionary Computation (AREA)
- Biophysics (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Databases & Information Systems (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a real-time personalized information flow recommendation method based on user behaviors, which is characterized by comprising the following steps of: collecting user behavior for the content; classifying the user behavior into a positive behavior and a negative behavior; acquiring content similar to the content clicked by the user, and sequencing according to the similarity to obtain a preliminary recommendation result; and performing weighting processing on the content sequence in the preliminary recommendation result based on the positive action and the negative action to obtain a final recommendation result. The method and the device can improve the recommendation effect and improve the user experience.
Description
Technical Field
The invention belongs to the field of intelligent recommendation, and particularly relates to a real-time personalized information flow recommendation method based on user behaviors.
Background
With the development of the internet and the popularization of smart phones, people have moved into the electronic reading era from paper reading, people more and more utilize smart phones to obtain various information from various large information flow software through the internet, and users can click, subscribe, share, comment and the like on the software to generate a large amount of user behaviors. And each piece of large software also utilizes the behaviors to perform personalized recommendation on the user.
When users interact with the information flow software, many behaviors such as praise, share, like, comment and the like can be generated. These behaviors can be classified into positive behaviors and negative behaviors from the emotion of the user. The behaviors that can reflect that the user likes the content are forward behaviors, and some behaviors are analyzed to reflect that the user does not like the content. For example, the comments of the user, which correspond to the subjective feeling of the user, are positive, positive comments are positive behavior, and negative comments are negative, which indicate that the user dislikes or even dislikes the content. Such content should therefore no longer be recommended to the user.
However, most software and mainstream information flow recommendation algorithms only select a small part of the behaviors or consider all user behaviors as forward behaviors, so that many recommended contents are generated, and the user does not like the forward behaviors, thereby influencing the user experience.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a real-time personalized information flow recommendation method based on user behaviors.
In order to achieve the purpose, the invention adopts the following technical scheme:
a real-time personalized information flow recommendation method based on user behaviors comprises the following steps: collecting user behavior for the content; classifying the user behavior into positive and negative behavior; acquiring content similar to the content clicked by the user, and sequencing according to the similarity to obtain a preliminary recommendation result; and performing weighting processing on the content sequence in the preliminary recommendation result based on the positive behavior and the negative behavior to obtain a final recommendation result.
Preferably, the user behaviors comprise comments, and the comments are classified into positive behaviors and negative behaviors based on a comment emotion classification model built by using Word2vec Word vectors and a long-short term memory model.
Preferably, the method for building the comment emotion classification model comprises the following steps: preprocessing the comments into an ordered combination of a plurality of words through a word dividing tool jieba; putting the ordered combination into a Word2vec model, and training Word vectors by adopting a skip-gram method; accumulating word vectors of words contained in the comments to obtain sentence vectors of the comments; the vectorized comments and the pre-labeled results are placed in an LSTM model for supervised training.
Preferably, the user behavior comprises a click, and classifying the click into a positive behavior and a negative behavior comprises: based on the time that the user stays on the clicked content page after clicking, wherein the time is larger than or equal to a preset value, and the clicking is classified as forward behavior; the time is less than a predetermined value and the click is classified as negative-going behavior.
Preferably, the acquiring content similar to the content clicked by the user includes: and calculating the similarity between the contents by adopting a collaborative filtering algorithm based on the articles.
Preferably, the similarity of the contents is judged by using the Jaccard coefficient, and the larger the coefficient value is, the more similar is shown, and the formula of the Jaccard coefficient is as follows:
where n (i) represents a set of users who like content i, and n (j) represents a set of users who like content j.
An electronic device, comprising: a processor; and the memory is provided with executable instructions which can be executed by the processor, and the executable instructions realize the recommendation method after being executed.
A computer readable storage medium storing computer instructions which, when executed by a processor, implement the steps of the recommendation method.
A real-time personalized information flow recommendation system based on user behavior, the recommendation system comprising: a collection module that collects user behavior for content; a classification module that classifies the user behavior as positive and negative behavior; the recall module acquires contents similar to contents clicked by a user and carries out sequencing according to the similarity to obtain a preliminary recommendation result; and
and the weighting module is used for weighting the content sequence in the recommendation pool based on the positive behavior and the negative behavior to obtain a final recommendation result.
Compared with the prior art, the invention has the beneficial effects that:
1. the information hidden behind the user behavior can be fully mined, and the recommendation quality is obviously improved;
2. the recommendation result is more intelligent, and the user experience is obviously improved;
3. and a part of inferior content can be found from negative-going behaviors, so that the platform is helped to control the content quality.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.
FIG. 1 is a schematic flow chart of an embodiment of the present invention.
FIG. 2 is a schematic diagram of an LSTM model according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all embodiments. All other embodiments, which can be obtained by a person skilled in the art without inventive effort based on the embodiments of the present invention, are within the scope of the present invention.
As shown in fig. 1-2, the present embodiment provides the following technical solutions:
1. collecting the behaviors of the user:
and collecting and storing behaviors generated by users in the software through a data collection module, wherein the behaviors comprise clicks, comments, praise, likes, subscriptions and the like. It is also necessary to record the time the user clicks on each content and the content of the comments. And taking out part of comments of the users, and manually marking the comments of the users according to different emotions, wherein the comments are divided into positive comments and negative comments.
2. Classifying the user's behavior:
most user behaviors can be directly relegated to forward behaviors, such as praise, like, subscribe, and the like. Some user behaviors can also be directly relegated to negative behaviors, such as dislike, deletion, etc. But for click and comment behavior, certain analysis is needed before classification.
3. Classifying click behaviors:
the time length of the user staying on the content after clicking is calculated according to the time of clicking of the user. If the dwell time is short, meaning that the user may be just attracted by the title, and not interested in the content, then negative behavior is considered. If the dwell time is longer, it indicates that the user is likely to read the entire content, and the recommended content is of great interest, it may be considered a forward behavior.
4. Classifying the comment behavior:
for review behavior: and (3) building a comment emotion classification model by using the Word2vec Word vector and a long-short term memory model (long-short term) and classifying the comments of the user into positive comments and negative comments, wherein the positive comments correspond to positive behaviors, and the negative comments correspond to negative behaviors.
All comments of a user are preprocessed, and by utilizing an open-source word segmentation tool jieba, all comments are segmented and meaningless single words such as ' and ' ground ' are removed, namely each comment is changed into an ordered combination of a plurality of words. Putting the processed comments into a Word2vec model, and training Word vectors by adopting a skip-gram method, wherein W represents a whole vocabulary, wiFor a given word, wjIs a contextual word.Andis corresponding to the target word wiE.w and its surrounding potential vector representations of context words.
Skip-gram formula in Word2 vec:
after training, each word can be represented by a high-dimensional vector. And accumulating word vectors of words contained in the comments to obtain a sentence vector of the comment.
And putting the vectorized comments and the marked results into an LSTM model for supervised training. An emotion classification model suitable for the user comments in the scene can be obtained. I.e. a comment is entered, whether the comment belongs to a positive comment or a negative comment can be obtained.
LSTM model formula:
zt=σ(Wz·[ht-1,xt])
rt=σ(Wr·[ht-1,xt])
wherein,xtfor the current input, ht-1For the last node to pass the status of, Wz、WrAnd W are weights.
5. Recalling content
The recalling is to find the content which is most similar to the content clicked by the user, and sort according to the similarity, wherein the more similar, the higher the sort. Here, an Item-based collaborative filtering (Item-based collaborative filtering) algorithm is used to calculate the similarity between content and content. Similarity is judged by using the Jaccard coefficient, and the larger the coefficient value is, the more similar the coefficient value is.
The formula for the Jaccard coefficient is:
where n (i) represents a set of users who like content i, and n (j) represents a set of users who like content j.
6. Reordering to generate recommended results
And classifying the behaviors of the user, finding the recall results related to the contents in the recall results for the contents with the positive behaviors of the user, and weighting the results to make the results ranked higher. For the content with the negative behavior of the user, recall results relevant to the content are found in the recall results, and the results are subjected to weight reduction or deleted from the recall results. And after the behaviors of the user are analyzed, generating a final recommendation result in real time.
Although the present invention has been described in detail with respect to the above embodiments, it will be understood by those skilled in the art that modifications or improvements based on the disclosure of the present invention may be made without departing from the spirit and scope of the invention, and these modifications and improvements are within the spirit and scope of the invention.
Claims (9)
1. A real-time personalized information flow recommendation method based on user behaviors is characterized by comprising the following steps:
collecting user behavior for the content;
Classifying the user behavior into a positive behavior and a negative behavior;
acquiring content similar to the content clicked by the user, and sequencing according to the similarity to obtain a preliminary recommendation result;
and performing weighting processing on the content sequence in the preliminary recommendation result based on the positive action and the negative action to obtain a final recommendation result.
2. The real-time personalized information flow recommendation method based on user behaviors as claimed in claim 1, wherein the user behaviors include comments, and the classification of the comments into positive behaviors and negative behaviors is based on a comment emotion classification model built by using Word2vec Word vectors and a long-short term memory model.
3. The real-time personalized information flow recommendation method based on user behaviors as claimed in claim 2, wherein the construction method of the comment emotion classification model comprises the following steps:
preprocessing the comments into an ordered combination of a plurality of words through a word segmentation tool jieba;
putting the ordered combination into a Word2vec model, and training Word vectors by adopting a skip-gram method;
accumulating word vectors of words contained in the comments to obtain sentence vectors of the comments;
the vectorized comments and the pre-labeled results are placed in an LSTM model for supervised training.
4. The method of claim 1, wherein the user behavior comprises clicks, and classifying the clicks as positive and negative behaviors comprises:
based on the time that the user stays on the clicked content page after clicking, wherein the time is more than or equal to a preset value, and the clicking is classified into forward behaviors; the time is less than a predetermined value and the click is classified as negative-going behavior.
5. The method according to claim 1, wherein the obtaining content similar to the content clicked by the user comprises:
and calculating the similarity between the contents by adopting a collaborative filtering algorithm based on the articles.
6. The method of claim 5, wherein the similarity of the content is determined by using the Jaccard coefficient, and the larger the value of the coefficient, the more similar the similarity is indicated, and the formula of the Jaccard coefficient is:
where n (i) represents a set of users who like content i, and n (j) represents a set of users who like content j.
7. An electronic device, comprising:
A processor;
a memory, wherein executable instructions are provided in the memory for execution by the processor, and when executed, the executable instructions implement the recommended method of any one of claims 1-6.
8. A computer-readable storage medium storing computer instructions for implementing the steps of the recommendation method according to any one of claims 1-6 when executed by a processor.
9. A real-time personalized information flow recommendation system based on user behavior, the recommendation system comprising:
a collection module that collects user behavior for content;
a classification module that classifies the user behavior as positive and negative behavior;
the recall module acquires contents similar to contents clicked by a user and sorts the contents according to the similarity to obtain a primary recommendation result; and
and the weighting module is used for weighting the content sequence in the recommendation pool based on the positive behavior and the negative behavior to obtain a final recommendation result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010558811.9A CN111859165A (en) | 2020-06-18 | 2020-06-18 | Real-time personalized information flow recommendation method based on user behaviors |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010558811.9A CN111859165A (en) | 2020-06-18 | 2020-06-18 | Real-time personalized information flow recommendation method based on user behaviors |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111859165A true CN111859165A (en) | 2020-10-30 |
Family
ID=72986233
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010558811.9A Pending CN111859165A (en) | 2020-06-18 | 2020-06-18 | Real-time personalized information flow recommendation method based on user behaviors |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111859165A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114416503A (en) * | 2021-12-31 | 2022-04-29 | 北京五八信息技术有限公司 | Data processing method and device, electronic equipment and storage medium |
CN117709914A (en) * | 2024-02-05 | 2024-03-15 | 天津徙木科技有限公司 | Post matching method and system |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108573411A (en) * | 2018-04-17 | 2018-09-25 | 重庆理工大学 | Depth sentiment analysis and multi-source based on user comment recommend the mixing of view fusion to recommend method |
-
2020
- 2020-06-18 CN CN202010558811.9A patent/CN111859165A/en active Pending
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108573411A (en) * | 2018-04-17 | 2018-09-25 | 重庆理工大学 | Depth sentiment analysis and multi-source based on user comment recommend the mixing of view fusion to recommend method |
Non-Patent Citations (2)
Title |
---|
张宜浩 朱小飞等: ""基于用户评论的深度情感分析和多视图协同融合的混合推荐方法"", 《计算机学报》, vol. 42, no. 6, pages 1316 - 1333 * |
邓宁,牛宇: "《旅游大数据》", 31 August 2019, 旅游教育出版社, pages: 124 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114416503A (en) * | 2021-12-31 | 2022-04-29 | 北京五八信息技术有限公司 | Data processing method and device, electronic equipment and storage medium |
CN117709914A (en) * | 2024-02-05 | 2024-03-15 | 天津徙木科技有限公司 | Post matching method and system |
CN117709914B (en) * | 2024-02-05 | 2024-05-10 | 台州徙木数字服务有限公司 | Post matching method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111444428B (en) | Information recommendation method and device based on artificial intelligence, electronic equipment and storage medium | |
CN108829822B (en) | Media content recommendation method and device, storage medium and electronic device | |
CN108563722B (en) | Industry classification method, system, computer device and storage medium for text information | |
CN111767403B (en) | Text classification method and device | |
CN112131350B (en) | Text label determining method, device, terminal and readable storage medium | |
CN106599022B (en) | User portrait forming method based on user access data | |
CN111008278A (en) | Content recommendation method and device | |
CN109918539B (en) | Audio and video mutual retrieval method based on user click behavior | |
CN113254637B (en) | Grammar-fused aspect-level text emotion classification method and system | |
CN105279495A (en) | Video description method based on deep learning and text summarization | |
CN109086265B (en) | Semantic training method and multi-semantic word disambiguation method in short text | |
CN111368075A (en) | Article quality prediction method and device, electronic equipment and storage medium | |
CN111859149A (en) | Information recommendation method and device, electronic equipment and storage medium | |
CN111382361A (en) | Information pushing method and device, storage medium and computer equipment | |
CN111353044B (en) | Comment-based emotion analysis method and system | |
CN112633690B (en) | Service personnel information distribution method, device, computer equipment and storage medium | |
CN109960791A (en) | Judge the method and storage medium, terminal of text emotion | |
CN117972206A (en) | Content recommendation system, electronic equipment and storage medium based on artificial intelligence | |
CN111859165A (en) | Real-time personalized information flow recommendation method based on user behaviors | |
CN114416969B (en) | LSTM-CNN online comment emotion classification method and system based on background enhancement | |
CN115186085A (en) | Reply content processing method and interaction method of media content interaction content | |
CN112163415A (en) | User intention identification method and device for feedback content and electronic equipment | |
CN113407776A (en) | Label recommendation method and device, training method and medium of label recommendation model | |
CN111798217B (en) | Data analysis system and method | |
CN114238798A (en) | Search ranking method, system, device and storage medium based on neural network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |