CN114971741A

CN114971741A - Method and terminal for positioning network literature user loss sections

Info

Publication number: CN114971741A
Application number: CN202210744248.3A
Authority: CN
Inventors: 邹建峰; 陈旺; 陈居晟
Original assignee: Fuzhou Changxin Information Technology Co ltd
Current assignee: Fuzhou Changxin Information Technology Co ltd
Priority date: 2022-06-27
Filing date: 2022-06-27
Publication date: 2022-08-30
Anticipated expiration: 2042-06-27
Also published as: CN114971741B

Abstract

The invention discloses a method and a terminal for positioning network literature user loss chapters, wherein a preset number of chapters are taken as a unit, the unit user loss rate of each unit of a designated book is calculated, and the average unit user loss rate is calculated; judging whether the unit user churn rate of a first unit is larger than the average unit user churn rate of a preset multiplying power or not, and if so, positioning each chapter in the first unit; acquiring and analyzing book review information of each chapter in the first unit to obtain all chapters with the bad review number exceeding a preset threshold value as problem content chapters; through calculation of the user loss rate and analysis of user evaluation, problem content chapters causing a large amount of loss of the user can be effectively and accurately positioned, so that the content of the chapters can be adjusted subsequently or corresponding processing can be carried out to improve the content quality of the novel, and the purpose of improving user retention is achieved.

Description

Method and terminal for positioning network literature user loss sections

Technical Field

The invention relates to the technical field of electronic reading, in particular to a method and a terminal for positioning lost chapters of a network literature user.

Background

Chinese has 5 hundred million network users, and the network novels are recorded in millions, and have different advantages and disadvantages. The influence of the content quality of the novel in the operation process of the network literature novel is particularly important. Through a large amount of data analysis, we can conclude that, among the abnormal user loss reasons, the user loss caused by the novel content problem accounts for more than 80% of the analysis data, and other various reasons account for 20% in total. Therefore, in order to improve the retention rate of the user and reduce the loss of the user, the user needs to accurately find out the problem content chapters, correct and adjust the chapter content or perform corresponding processing to improve the content quality of the novel, so as to achieve the purpose of improving the retention rate of the user.

However, how to locate the problem content section is an urgent problem to be solved at present.

Disclosure of Invention

The technical problem to be solved by the invention is as follows: the method and the terminal for positioning the network literature user loss chapters can accurately position the problem content chapters causing user loss.

In order to solve the technical problems, the invention adopts the technical scheme that:

a method for positioning network literature user loss sections comprises the following steps:

s1, calculating the unit user attrition rate of each unit of the appointed book by taking a preset number of chapters as a unit, and calculating the average unit user attrition rate;

s2, judging whether the unit user churn rate of a first unit is larger than the average unit user churn rate of a preset multiplying power, and if so, positioning each chapter in the first unit;

and S3, acquiring and analyzing the book comment information of each chapter in the first unit, and obtaining all chapters with the bad comment quantity exceeding a preset threshold value as question content chapters.

In order to solve the technical problem, the invention adopts another technical scheme as follows:

a terminal for locating network literature user churn chapters, comprising a processor, a memory, and a computer program stored in the memory and executable on the processor, when executing the computer program, implementing the steps of:

The invention has the beneficial effects that: according to the method, the electronic book calculates the user loss rate by taking the preset number of chapters as a unit, so that a range of the problem chapters causing the user loss is determined, and the chapters with poor user evaluation, namely the problem content chapters causing the user loss, are further screened out according to the user evaluation condition of the chapters in the range, so that the problem content chapters causing the user loss can be effectively and accurately positioned, the content of the chapters can be adjusted or correspondingly processed subsequently, the content quality of the novel can be improved, and the purpose of improving the user retention can be achieved.

Drawings

Fig. 1 is a flowchart of a method for locating network literature user churn sections according to an embodiment of the present invention;

fig. 2 is a structural diagram of a terminal for positioning network literature user churn chapters according to an embodiment of the present invention;

fig. 3 is a detailed flowchart of a method for locating network literature user churn sections according to an embodiment of the present invention;

description of reference numerals:

1. a terminal for positioning network literature user loss chapters; 2. a processor; 3. a memory.

Detailed Description

In order to explain technical contents, achieved objects, and effects of the present invention in detail, the following description is made with reference to the accompanying drawings in combination with the embodiments.

Referring to fig. 1, a method for locating network literature user churn sections includes the steps:

From the above description, the beneficial effects of the present invention are: according to the method, the electronic book calculates the user loss rate by taking the preset number of chapters as a unit, so that a range of the problem chapters causing the user loss is determined, and the chapters with poor user evaluation, namely the problem content chapters causing the user loss, are further screened out according to the user evaluation condition of the chapters in the range, so that the problem content chapters causing the user loss can be effectively and accurately positioned, the content of the chapters can be adjusted or correspondingly processed subsequently, the content quality of the novel can be improved, and the purpose of improving the user retention can be achieved.

Further, the step S1 is specifically:

s11, calculating the unit user churn rate of each unit by taking a preset number of chapters as a unit aiming at the payment chapters of the appointed book, wherein the calculation of the unit user churn rate specifically comprises the following steps:

rate＝1-[num(n+a)/num(n)]；

wherein a represents a preset number, num (n + a) represents the number of unlocking users of the (n + a) th chapter, num represents the number of unlocking users of the nth chapter, and the first unit comprises the (n + 1) th chapter to the (n + a) th chapter;

and S12, calculating the average value of the unit user loss rate of each unit to obtain the average unit user loss rate.

From the above description, the user loss rate during the period from chapter n +1 to chapter n + a is calculated according to the number of unlocking users in chapter n and the number of unlocking users in chapter n + a, which is direct, reasonable and more accurate.

Further, the step between the step S2 and the step S3 further includes the steps of:

s21, acquiring chapter unlocking information of each lost user in a first unit, and judging whether the lost user is a normal lost user or an abnormal lost user;

judging whether the lost user is a normal lost user or an abnormal lost user specifically comprises the following steps:

judging whether electronic money or reading gift certificates are reserved in the account after the user unlocks the payment section finally, if so, the user is abnormally lost, otherwise, the user is normally lost;

the book rating information in step S3 is the book rating information of all the users who abnormally lose.

From the above description, if there is no user loss caused by remaining electronic money, we consider that the user loss is normal loss, and the user is not very interested in the content of the book, but only in order to consume the electronic money on hand, the part of people is not in the processing range of the scheme, and relatively, the part of lost users who are not used by the remaining electronic money on the account but no longer unlock the subsequent chapters is the object of the analysis research by the scheme, i.e. the abnormal lost user, and we screen the abnormal lost user for the subsequent review analysis of the book, and then determine the chapters of the problem content, thereby reducing the required computing resources and improving the accuracy of positioning the problem content.

Further, the step S3 of obtaining all chapters in which the number of bad comments exceeds the preset threshold as question content chapters specifically includes:

judging whether chapters with the bad comment quantity exceeding a preset threshold exist in the first unit, if so, marking all chapters with the bad comment quantity exceeding the preset threshold as problem content chapters, and otherwise, entering the step S4;

the step S3 is followed by the step of:

s4, acquiring and analyzing reading behavior information of an abnormal loss user, and calculating a first average page turning speed of the abnormal loss user for reading the specified book and a first average operation interval time for reading a next chapter, and a second average page turning speed of each chapter in a first unit for reading and a second average operation interval time for reading the next chapter;

and if the difference value between the second average page turning speed and the first average page turning speed of the chapters exceeds a preset speed threshold value, or the difference value between the second average operation interval time and the first average operation interval time exceeds a preset interval threshold value, marking the chapters as problem content chapters.

As can be seen from the above description, after the analysis is performed according to the good comment and the bad comment of the book comment, the problem content section still cannot be determined, and we can perform the analysis by the user behavior abnormality of the user, thereby identifying the problem content section.

Further, the step S3 is specifically:

s31, acquiring book review information in the first unit, screening the book review information according to a preset keyword library, eliminating invalid book review information in the book review information, and screening out bad reviews in the valid book review information;

and S32, screening all chapters with the number of bad comments exceeding a preset threshold value in the first unit as question content chapters.

As can be seen from the above description, the present invention screens the reviews of the books through the preset keyword library to screen out invalid reviews of the books, such as randomly typed character strings, watered characters, characters without actual meaning or content, etc., leaves valid reviews of the books and screens out bad reviews of the books, and judges whether the chapters are question content chapters according to the number of the bad reviews.

Further, the step between the step S31 and the step S32 further includes the steps of:

s311, judging whether the book review information is valid or bad, and if the book review information is valid or bad, sending the book review information which cannot be identified to manual work for identification.

As can be seen from the above description, for the book comments which cannot be intelligently identified, the book comments can be sent to the manual work for identification.

Further, the step S3 is replaced with:

reading behavior information of an abnormal loss user is obtained and analyzed, and a first average page turning speed of the abnormal loss user for reading the appointed book and a first average operation interval time for reading a next chapter, and a second average page turning speed of each chapter in a first unit for reading and a second average operation interval time for reading the next chapter are calculated;

As can be seen from the above description, after the chapters in the first unit are located, the reading behavior of the user can also be directly analyzed, so as to identify the question content chapters.

Further, the step S3 is replaced with:

and acquiring book review information of each chapter in the first unit, performing semantic analysis on the book review information according to a preset keyword library, and marking the chapter corresponding to the first book review as a question content chapter if the semantics of the first book review fall into a preset semantic set.

As can be seen from the above description, after each chapter in the first unit is located, the question content chapter can also be determined by analyzing the semantics of the book review, such as the book review with specific semantics, e.g., the book review with semantics "x is very annoying and book discard".

Further, the preset number is 10.

As can be seen from the above description, the unit of chapter 10 is neater and easier to count.

Referring to fig. 2, a terminal for locating network literature user churn chapters includes a processor, a memory, and a computer program stored in the memory and executable on the processor, where the processor implements the steps when executing the computer program.

The method and the terminal for positioning the network literature user loss sections are suitable for a network reading industry, and are used for finding a scene that a certain book has a large number of readers lost and needs to be traced.

Referring to fig. 1 and fig. 3, a first embodiment of the present invention is:

the step S1 specifically includes:

s11, calculating a unit user churn rate of each unit by taking a preset number of chapters as a unit for paying chapters of a specific book, where the calculation of the unit user churn rate specifically includes:

rate＝1-[num(n+a)/num(n)]；

In this embodiment, we monitor chapter run-off rate for the explosive book a in the platform. For each chapter 10 of the payment chapter of book a, a unit user churn rate [1- (chapter N +10 unlock people/chapter N unlock people) ] and an average unit user churn rate are calculated.

In this embodiment, the preset number is 10, and in other equivalent embodiments, the preset number may also be other values, which are set according to actual requirements.

in this embodiment, the unit user loss rate of each pay chapter of 10 chapters is compared with the average unit user loss rate, and if the unit user loss rate of a certain chapter is more than 2 times of the average unit user loss rate, an early warning is given to obtain a large loss chapter range.

In this embodiment, in the process of monitoring the average payment rate and the average loss rate of the book a, the program finds that the user has a large loss between chapters 105 and 114 (payment chapters) of the book, so the data abnormal chapter determined by us is chapter 105 and 115 of the book.

The method between the step S2 and the step S3 further comprises the steps of:

In this embodiment, the criterion of the normal loss of the user is that after the user unlocks the last reading chapter, whether the user has remaining money or gift certificates in the account, and if the user has no loss due to remaining electronic money, the user is deemed to be normal loss, which is not very interested in the content of the book, but only in order to consume the electronic money on the hand, and this part of people is not in the processing category of the scheme. In contrast, the part of the lost users who still have the remaining electronic money on the account and do not unlock the subsequent chapters is the object of the scheme to be analyzed and researched, namely, the abnormal lost users.

When the range of the problem content chapter is located (large loss chapter range), due to the fact that the number of the book chapters is large and the number of readers is large, under the condition that the characteristic that a large number of users consume electronic money or gift certificates in a certain unit does not exist, the influence of normal loss users on the range location is relatively small, but when the problem chapter needs to be accurately located in the large loss chapter range, the interference of normal loss should be eliminated. Meanwhile, due to the fact that book review is needed subsequently or user behaviors are possibly analyzed, the calculation resources occupied by the subsequent steps can be effectively reduced by removing users who are lost normally.

In this embodiment, for chapter 10 of chapters 105-115, the system performs data analysis on the user who has read chapter 10, and screens out the user who runs normally by determining whether there is digital currency remaining but no longer paying for unlocking the chapters, and the screened out user who runs abnormally enters the book review analysis link of the next link.

S3, acquiring and analyzing book review information of each chapter in the first unit, and obtaining all chapters with the bad review number exceeding a preset threshold value as question content chapters;

the step S3 specifically includes:

s311, judging whether the book review information is not identified to be effective or bad, and if the book review information is not identified to be effective or bad, sending the book review information which is not identified to a worker for identification;

In the embodiment, the keyword library which is manually collected and continuously supplemented is used for book review screening, invalid book reviews are eliminated, and the intelligent classification of the valid book reviews into good reviews and poor reviews is carried out. And for part of the book reviews which cannot be intelligently identified and divided, manual analysis can be carried out. The method for distinguishing good comments from bad comments is a method for continuously enriching comment libraries through artificial learning by utilizing a keyword library and analyzing and reading the comment contents to classify comments (the comments can be classified by reading according to Taobao goods comments). If some chapters have a large number of bad scores (exceeding a manually set threshold), then those chapters are located as problem content chapters.

Referring to fig. 3, the second embodiment of the present invention is:

a method for locating sections lost by network literature users, which is different from the first embodiment in that, in the step S3, the step of obtaining all sections in which the number of bad comments exceeds a preset threshold is specifically a question content section:

and judging whether chapters with the bad comment quantity exceeding a preset threshold exist in the first unit, if so, marking all chapters with the bad comment quantity exceeding the preset threshold as problem content chapters, and otherwise, entering the step S4.

In this embodiment, if there are no valid chapters or a large number of badly rated chapters within the range of the checked large-loss chapters, the user behavior is determined. By reading an APP or a page program and the like in advance, reading behavior data of a user is counted when the user reads, such as an average page turning speed and an average operation interval time of the user reading a next chapter. We compare the chapter paging speed and the operation interval time within the investigation range. And if the page turning speed of a user in a certain chapter is greater than or less than the average page turning speed of the whole book and exceeds a preset speed threshold, or the time interval between the user entering the next chapter is greater than or less than the average time interval between the user entering the next chapter and the whole book and exceeds a preset interval threshold, locating the chapter as a problem content chapter.

The preset speed threshold and the preset interval threshold may be set as percentages of the average page turning speed or the average operation interval time corresponding to the whole book, or may be specific numerical values. In actual use, a normal numerical range can be specified according to the average page turning speed or the average operation interval time of the whole book, and if the range is exceeded, the chapter is positioned as a question content chapter.

The problem content chapters determined through the flow of the scheme are specific chapters, and after the problem chapter data is received subsequently, the chapter range with more concentrated data is clear actually. If further needs exist, the scheme of the invention also achieves the effect of further reducing the scope of problem sections.

The third embodiment of the invention is as follows:

a method for locating network literature user churn chapters, which is different from the first embodiment in that step S3 is replaced by:

The fourth embodiment of the invention is as follows:

In this embodiment, after the book review information of each chapter in the first unit is acquired, semantic analysis is performed on each book review through the preset keyword library to determine whether any semantic in the preset semantic set is satisfied, and if the satisfied book review exists, the chapter corresponding to the book review is marked as a problem content chapter. The semantics of the preset semantic set include ". x.is annoying and leaves a book", and the like, and the content of the chapter has a negative emotion or indicates the intention to leave a book.

Referring to fig. 2, a fifth embodiment of the present invention is:

a terminal 1 for locating network literature user churn chapters comprises a processor 2, a memory 3 and a computer program stored in the memory 3 and capable of running on the processor 2, wherein the processor 2 realizes the steps in the first embodiment when executing the computer program.

In summary, the method and the terminal for positioning the network literature user loss chapters provided by the present invention determine a range in which the problem chapters causing the user loss exist by calculating the user loss rate through the electronic book with a preset number of chapters as a unit, further screen out chapters with poor user evaluation according to the user evaluation condition of the chapters in the range, and screen out chapters with abnormal behavior according to the user behavior data, that is, the problem content chapters causing the user loss, and can effectively and accurately position the problem content chapters causing the user loss in order to subsequently adjust the chapter content or perform corresponding processing to improve the content quality of the novel, thereby achieving the purpose of improving the user retention. And the identification of normal loss users and abnormal loss users can be effectively realized according to the user account condition, so that the positioning of problem content chapters is more efficient and accurate.

The above description is only an embodiment of the present invention, and not intended to limit the scope of the present invention, and all equivalent changes made by using the contents of the present specification and the drawings, or applied directly or indirectly to the related technical fields, are included in the scope of the present invention.

Claims

1. A method for positioning network literature user loss chapters is characterized by comprising the following steps:

2. The method for locating sections lost by network literature users according to claim 1, wherein the step S1 specifically comprises:

rate＝1-[num(n+a)/num(n)]；

3. The method for locating network literature user churn chapters according to claim 1, wherein the step between the step S2 and the step S3 further comprises the steps of:

4. The method according to claim 3, wherein the step S3 of obtaining all chapters with the number of bad comments exceeding a preset threshold is specifically:

the step S3 is followed by the step of:

5. The method of claim 1, wherein the step S3 includes:

and S32, screening all chapters with the number of bad comments exceeding a preset threshold value in the first unit as problem content chapters.

6. The method of claim 5, wherein between the step S31 and the step S32, the method further comprises the steps of:

7. The method for locating sections lost by network literature users according to claim 1, wherein the step S3 is replaced by:

8. The method for locating network literature user churn chapters according to claim 1, wherein the step S3 is replaced by:

9. The method of claim 1, wherein the predetermined number is 10.

10. A terminal for locating network literature user churn chapters, comprising a processor, a memory and a computer program stored in the memory and operable on the processor, wherein the processor executes the computer program to implement the steps of any one of the methods for locating network literature user churn chapters of claims 1-9.