CN111105261B - User identification method, device, electronic equipment and storage medium - Google Patents
User identification method, device, electronic equipment and storage medium Download PDFInfo
- Publication number
- CN111105261B CN111105261B CN201811270903.6A CN201811270903A CN111105261B CN 111105261 B CN111105261 B CN 111105261B CN 201811270903 A CN201811270903 A CN 201811270903A CN 111105261 B CN111105261 B CN 111105261B
- Authority
- CN
- China
- Prior art keywords
- user
- address
- user behavior
- behavior data
- target user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 66
- 230000006399 behavior Effects 0.000 claims description 132
- 238000013507 mapping Methods 0.000 claims description 18
- 238000011156 evaluation Methods 0.000 claims description 4
- 238000010586 diagram Methods 0.000 description 6
- 230000008901 benefit Effects 0.000 description 5
- 238000004590 computer program Methods 0.000 description 5
- 230000002159 abnormal effect Effects 0.000 description 4
- 230000009471 action Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 4
- 238000012795 verification Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 238000009825 accumulation Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 2
- 238000013145 classification model Methods 0.000 description 2
- 238000013480 data collection Methods 0.000 description 2
- 238000003066 decision tree Methods 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- 238000010276 construction Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000011835 investigation Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0242—Determining effectiveness of advertisements
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q30/00—Commerce
- G06Q30/02—Marketing; Price estimation or determination; Fundraising
- G06Q30/0241—Advertisements
- G06Q30/0277—Online advertisement
Landscapes
- Business, Economics & Management (AREA)
- Engineering & Computer Science (AREA)
- Accounting & Taxation (AREA)
- Development Economics (AREA)
- Strategic Management (AREA)
- Finance (AREA)
- Game Theory and Decision Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Economics (AREA)
- Marketing (AREA)
- Physics & Mathematics (AREA)
- General Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a user identification method, a user identification device, electronic equipment and a storage medium. The method comprises the following steps: acquiring user behavior data of a target user; extracting an IP address corresponding to the user behavior from the user behavior data; and identifying whether the target user is a user of a specified type according to a preset IP address judging rule and the extracted IP address. The technical scheme carries out user identification from the dimension of the IP address, is suitable for identifying users generated by modes such as cheating and the like under the scenes such as user registration and the like, and is convenient for reducing the loss of popularization of products.
Description
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a user identification method, a device, an electronic apparatus, and a storage medium.
Background
In various industries, products are not supported by users. In order to absorb more users (also called "get customers"), products often promote content through different channels, for example, depending on advertisements, in the internet age, users can very conveniently jump to the registration page of the product by clicking on the advertisements to register as users of the product.
Products often select multiple content channels for promotion and usually pay for the promotion according to the promotion effect. For example, the number of users registered through an advertising channel is one factor under investigation. Some advertising channel providers perform cheating in order to make a profit, so that some cheating users actually exist in registered users. There is a need for a method that can identify specific types of users such as cheating.
Disclosure of Invention
The present invention has been made in view of the above-mentioned problems, and provides a user identification method, apparatus, electronic device, and storage medium that overcome or at least partially solve the above-mentioned problems.
According to an aspect of the present invention, there is provided a user identification method, comprising:
acquiring user behavior data of a target user;
extracting an IP address corresponding to the user behavior from the user behavior data;
and identifying whether the target user is a user of a specified type according to a preset IP address judging rule and the extracted IP address.
Optionally, the acquiring the user behavior data of the target user includes:
And extracting and sorting user behavior data of the target user from the user behavior dotting log according to the user identification.
Optionally, the method further comprises:
Providing a front-end page comprising a plurality of buried points, and collecting the user behavior dotting logs according to the buried points;
The front page includes: registration pages and/or product pages.
Optionally, the user behavior data of the target user is user behavior data related to user registration.
Optionally, the identifying whether the target user is a user of a specified type according to the preset IP address determination rule and the extracted IP address includes:
Determining the geographic position corresponding to the IP address;
and judging whether the geographic position is matched with the delivery area of the product content, otherwise, marking the target user as a user of a specified type.
Optionally, the obtaining the geographic location corresponding to the IP address includes:
And determining the geographic position corresponding to the IP address according to the preset mapping relation between the IP address and the geographic position.
Optionally, the mapping relation between the IP address and the geographic location is stored in a preset IP address location database.
Optionally, the determining whether the geographic location matches with the delivery area of the product content, if so, marking the target user as the user of the specified type includes:
when the extracted IP addresses are multiple, if the geographic position corresponding to any IP address is not matched with the delivery area of the product content, marking the target user as a user of a specified type.
Optionally, the method further comprises:
counting the proportion of users of a designated type among the users registered according to each content channel in a preset time period;
and evaluating the quality of each content channel according to the proportion.
According to another aspect of the present invention, there is provided a user identification apparatus comprising:
the acquisition unit is suitable for acquiring user behavior data of a target user;
The extracting unit is suitable for extracting the IP address corresponding to the user behavior from the user behavior data;
And the identification unit is suitable for identifying whether the target user is a user of a specified type according to a preset IP address judgment rule and the extracted IP address.
Optionally, the obtaining unit is adapted to extract and sort user behavior data of the target user from the user behavior dotting log according to the user identifier.
Optionally, the acquiring unit is adapted to provide a front end page including a plurality of buried points, and collect the user behavior dotting log according to the buried points; the front page includes: registration pages and/or product pages.
Optionally, the user behavior data of the target user is user behavior data related to user registration.
Optionally, the identifying unit is adapted to determine a geographic location corresponding to the IP address; and judging whether the geographic position is matched with the delivery area of the product content, otherwise, marking the target user as a user of a specified type.
Optionally, the identifying unit is adapted to determine the geographic location corresponding to the IP address according to a preset mapping relationship between the IP address and the geographic location.
Optionally, the mapping relation between the IP address and the geographic location is stored in a preset IP address location database.
Optionally, the identifying unit is adapted to mark the target user as a user of a specified type if the geographic location corresponding to any one of the IP addresses does not match the delivery area of the product content when the extracted IP addresses are plural.
Optionally, the apparatus further comprises:
the content channel evaluation unit is suitable for counting the proportion of users of a designated type among the users registered according to each content channel in a preset time period; and evaluating the quality of each content channel according to the proportion.
According to still another aspect of the present invention, there is provided an electronic apparatus including: a processor; and a memory arranged to store computer executable instructions that, when executed, cause the processor to perform a method as described in any of the above.
According to a further aspect of the present invention there is provided a computer readable storage medium storing one or more programs which when executed by a processor implement a method as described in any of the above.
As can be seen from the above, according to the technical solution of the present invention, by acquiring the user behavior data of the target user, extracting the IP address corresponding to the user behavior from the user behavior data, and identifying whether the target user is a user of a specified type according to the preset IP address determination rule and the extracted IP address. The technical scheme carries out user identification from the dimension of the IP address, is suitable for identifying users generated by modes such as cheating and the like under the scenes such as user registration and the like, and is convenient for reducing the loss of popularization of products.
The foregoing description is only an overview of the present invention, and is intended to be implemented in accordance with the teachings of the present invention in order that the same may be more clearly understood and to make the same and other objects, features and advantages of the present invention more readily apparent.
Drawings
Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to designate like parts throughout the figures. In the drawings:
FIG. 1 shows a flow diagram of a user identification method according to one embodiment of the invention;
FIG. 2 is a schematic diagram of a user identification device according to one embodiment of the present invention;
FIG. 3 shows a schematic diagram of an electronic device according to one embodiment of the invention;
fig. 4 illustrates a schematic structure of a computer-readable storage medium according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
Fig. 1 shows a flow diagram of a user identification method according to an embodiment of the invention. As shown in fig. 1, the method includes:
step S110, user behavior data of a target user is acquired.
In the actual data collection process, the user behavior data may include a kind of a user behavior, a time when the user behavior occurs, device information when the user behavior occurs, network environment information corresponding to the user behavior, and the like. In the embodiment of the invention, only the IP address can be selected for use.
Step S120, extracting the IP address corresponding to the user behavior from the user behavior data.
When the user performs the actions such as registration and browsing through the page, if the user interacts with the server, the IP address corresponding to the user action can be collected.
Step S130, according to preset IP address judging rules and the extracted IP addresses, whether the target user is a user of a specified type is identified.
It can be seen that, in the method shown in fig. 1, by acquiring user behavior data of a target user, extracting an IP address corresponding to the user behavior from the user behavior data, and identifying whether the target user is a user of a specified type according to a preset IP address judgment rule and the extracted IP address. The technical scheme carries out user identification from the dimension of the IP address, is suitable for identifying users generated by modes such as cheating and the like under the scenes such as user registration and the like, and is convenient for reducing the loss of popularization of products.
In one embodiment of the present invention, in the above method, acquiring user behavior data of the target user includes: and extracting and sorting user behavior data of the target user from the user behavior dotting log according to the user identification.
The collection of the specified information through the pre-buried points is a convenient method for the internet related products, in this embodiment, the user behavior dotting logs can be collected in this way, specifically, in one embodiment of the present invention, the method further includes: providing a front-end page comprising a plurality of buried points, and collecting a user behavior dotting log according to the buried points; the front page includes: registration pages and/or product pages.
For example, when the mobile phone number is input to the registration page and the verification code is input to the registration page, the user is triggered to embed the point and report the user behavior, and records are performed in the user behavior dotting log.
Therefore, due to the fact that the number of buried points is large, records related to behaviors of various users and various types of users are collected in the user behavior dotting logs. In the above embodiment, one user identification may be determined based on device information or the like. Therefore, when the user behavior data of a certain appointed user is needed, the user behavior dotting logs can be processed and integrated according to the user identification, so that the user behavior data of the same user can be obtained. Taking a registration scenario as an example, complete behavior chain data of a registered user before and after registration can be obtained.
It should also be noted that in the embodiment of the present invention, not only the user behavior occurring on the registration page, but also the user behavior occurring on the product page may be focused. Because many content channels are cheated by registering users in batches, which users also do not log into the product after registration, and cannot create revenue for the product, many product operators choose to pay for promotion based on the registered active users. Thus, cheating is not limited to only registration, but also after registration. Therefore, the embodiment of the invention can also collect the user behavior data on the product page. As described in the background art, one of the popularization forms of the content is advertisement, and the content in the embodiment of the invention can be advertisement, and the corresponding content channel is an advertisement channel.
In one embodiment of the present invention, in the method, the user behavior data of the target user is user behavior data related to user registration.
As can be seen from the foregoing description, the user will generate user behavior on the product page during the daily use of the product, so in order to obtain user behavior data related to user registration, user behavior data of content in a certain period of time may be obtained according to the occurrence time of the user registration event, and used as user behavior data related to user registration, or user behavior data of a specific type of user behavior such as an input verification code, an input mobile phone number, and the like may be obtained.
In one embodiment of the present invention, in the above method, identifying whether the target user is a user of a specified type according to a preset IP address determination rule and the extracted IP address includes: determining the geographic position corresponding to the IP address; and judging whether the geographic position is matched with the delivery area of the product content, otherwise, marking the target user as a user of a specified type.
A specific example of identifying a target user using an IP address judgment rule and an extracted IP address is given in the present embodiment. Content delivery is often regional limited, e.g., content is delivered only domestically, and then a user registration from a foreign IP address is generated through the domestically delivered content, most likely due to content channel cheating. Such users may be marked as fake users or as cheating users and the corresponding traffic may be marked as fake registration traffic. The content delivery area may be an administrative area such as a country, province, city, etc.
In an embodiment of the present invention, in the above method, obtaining a geographic location corresponding to an IP address includes: and determining the geographic position corresponding to the IP address according to the preset mapping relation between the IP address and the geographic position.
In this embodiment, by presetting the mapping relationship between the IP address and the geographic location, the IP address is converted into the geographic location that actually corresponds, so that it can be easily determined whether the IP address matches with the delivery area of the product content. For example, content is delivered in china and the IP address is located in the united states, the corresponding traffic may be false registration traffic.
In an embodiment of the present invention, in the above method, a mapping relationship between an IP address and a geographic location is stored in a preset IP address location database. In the embodiment, the method can be realized by utilizing the existing high-precision IP address positioning database, so that the data is more accurate and the effect is better. In one embodiment of the present invention, in the above method, determining whether the geographic location matches with the delivery area of the product content, if so, marking the target user as the user of the specified type includes: when the extracted IP addresses are multiple, if the geographic position corresponding to any IP address is not matched with the delivery area of the product content, marking the target user as a user of a specified type.
In the prior art, the identification of false users or cheating users can be carried out in various ways by judging whether the cheating behavior exists or not through feature engineering and a decision tree classification model. The disadvantages of these approaches are generally: 1) The interpretability is poor, multidimensional data support is needed, and high requirements are set for data quality, but in an actual service scene, the high quality of the data can not be ensured, so that the model identification quality is affected; 2) The historical database is required to be used for inquiring, a large amount of historical data is required to be accumulated, or a three-party database is purchased, so that the operation cost is increased; 3) The abnormal data distribution in a relatively long time dimension needs to be observed, the recognition based on the granularity of the user is not performed, the short-term or real-time recognition cannot be performed, and the timeliness is poor.
As can be seen from the above embodiments, the technical solution of the present invention for identifying a false user or a cheating user has the following advantages: 1) The original user behavior data is utilized, the judgment logic is popular and easy to understand, the interpretability is high, the acceptance is easy, and initiative is strived for the business in settlement negotiation; 2) When the user behavior data related to the user registration event (such as the current day) is only used for judging, long-term data accumulation is not needed, the abnormal judgment time is T+1 days, the time efficiency is greatly improved, and the timely recovery and the reduction of the loss of a product operator are facilitated; 3) And a third party library is not required to be additionally used, so that the operation cost is saved.
In one embodiment of the present invention, the method further comprises: counting the proportion of users of a designated type among the users registered according to each content channel in a preset time period; and evaluating the quality of each content channel according to the proportion.
For example, the identified users are cheating users or false users, and when such users are high, it is apparent that the quality of the content channel is poor, and vice versa.
Fig. 2 is a schematic diagram showing a structure of a subscriber identity module according to an embodiment of the present invention. As shown in fig. 2, the user identification apparatus 200 includes:
the obtaining unit 210 is adapted to obtain user behavior data of the target user.
In the actual data collection process, the user behavior data may include a kind of a user behavior, a time when the user behavior occurs, device information when the user behavior occurs, network environment information corresponding to the user behavior, and the like. In the embodiment of the invention, only the IP address can be selected for use.
An extracting unit 220 adapted to extract an IP address corresponding to the user behavior from the user behavior data;
When the user performs the actions such as registration and browsing through the page, if the user interacts with the server, the IP address corresponding to the user action can be collected.
The identifying unit 230 is adapted to identify whether the target user is a user of a specified type according to a preset IP address judging rule and the extracted IP address.
As can be seen, in the apparatus shown in fig. 2, the user behavior data of the target user is obtained through the mutual coordination of the units, the IP address corresponding to the user behavior is extracted from the user behavior data, and whether the target user is a user of a specified type is identified according to the preset IP address judgment rule and the extracted IP address. The technical scheme carries out user identification from the dimension of the IP address, is suitable for identifying users generated by modes such as cheating and the like under the scenes such as user registration and the like, and is convenient for reducing the loss of popularization of products.
In an embodiment of the present invention, in the foregoing apparatus, the obtaining unit 210 is adapted to extract and collate, from the user behavior dotting log, user behavior data of the target user according to the user identifier.
In this embodiment, the user behavior dotting logs may be collected in this way, and specifically, in one embodiment of the present invention, in the above-mentioned apparatus, the obtaining unit 210 is adapted to provide a front-end page including a plurality of buried points, and collect the user behavior dotting logs according to the buried points; the front page includes: registration pages and/or product pages.
For example, when the mobile phone number is input to the registration page and the verification code is input to the registration page, the user is triggered to embed the point and report the user behavior, and records are performed in the user behavior dotting log.
Therefore, due to the fact that the number of buried points is large, records related to behaviors of various users and various types of users are collected in the user behavior dotting logs. In the above embodiment, one user identification may be determined based on device information or the like. Therefore, when the user behavior data of a certain appointed user is needed, the user behavior dotting logs can be processed and integrated according to the user identification, so that the user behavior data of the same user can be obtained. Taking a registration scenario as an example, complete behavior chain data of a registered user before and after registration can be obtained.
It should also be noted that in the embodiment of the present invention, not only the user behavior occurring on the registration page, but also the user behavior occurring on the product page may be focused. Because many content channels are cheated by registering users in batches, which users also do not log into the product after registration, and cannot create revenue for the product, many product operators choose to pay for promotion based on the registered active users. Thus, cheating is not limited to only registration, but also after registration. Therefore, the embodiment of the invention can also collect the user behavior data on the product page. As described in the background art, one of the popularization forms of the content is advertisement, and the content in the embodiment of the invention can be advertisement, and the corresponding content channel is an advertisement channel.
In one embodiment of the present invention, in the apparatus, the user behavior data of the target user is user behavior data related to user registration.
As can be seen from the foregoing description, the user will generate user behavior on the product page during the daily use of the product, so in order to obtain user behavior data related to user registration, user behavior data of content in a certain period of time may be obtained according to the occurrence time of the user registration event, and used as user behavior data related to user registration, or user behavior data of a specific type of user behavior such as an input verification code, an input mobile phone number, and the like may be obtained.
In one embodiment of the present invention, in the foregoing apparatus, the identifying unit 230 is adapted to determine a geographic location corresponding to the IP address; and judging whether the geographic position is matched with the delivery area of the product content, otherwise, marking the target user as a user of a specified type.
A specific example of identifying a target user using an IP address judgment rule and an extracted IP address is given in the present embodiment. Content delivery is often regional limited, e.g., content is delivered only domestically, and then a user registration from a foreign IP address is generated through the domestically delivered content, most likely due to content channel cheating. Such users may be marked as fake users or as cheating users and the corresponding traffic may be marked as fake registration traffic. The content delivery area may be an administrative area such as a country, province, city, etc.
In an embodiment of the present invention, in the foregoing apparatus, the identifying unit 230 is adapted to determine the geographic location corresponding to the IP address according to a preset mapping relationship between the IP address and the geographic location.
In this embodiment, by presetting the mapping relationship between the IP address and the geographic location, the IP address is converted into the geographic location that actually corresponds, so that it can be easily determined whether the IP address matches with the delivery area of the product content. For example, content is delivered in china and the IP address is located in the united states, the corresponding traffic may be false registration traffic.
In an embodiment of the present invention, in the above device, a mapping relationship between an IP address and a geographic location is stored in a preset IP address location database. In the embodiment, the method can be realized by utilizing the existing high-precision IP address positioning database, so that the data is more accurate and the effect is better. In an embodiment of the present invention, in the above apparatus, the identifying unit 230 is adapted to mark the target user as the user of the specified type if the geographic location corresponding to any one of the IP addresses does not match the delivery area of the product content when the plurality of the extracted IP addresses exist.
In the prior art, the identification of false users or cheating users can be carried out in various ways by judging whether the cheating behavior exists or not through feature engineering and a decision tree classification model. The disadvantages of these approaches are generally: 1) The interpretability is poor, multidimensional data support is needed, and high requirements are set for data quality, but in an actual service scene, the high quality of the data can not be ensured, so that the model identification quality is affected; 2) The historical database is required to be used for inquiring, a large amount of historical data is required to be accumulated, or a three-party database is purchased, so that the operation cost is increased; 3) The abnormal data distribution in a relatively long time dimension needs to be observed, the recognition based on the granularity of the user is not performed, the short-term or real-time recognition cannot be performed, and the timeliness is poor.
As can be seen from the above embodiments, the technical solution of the present invention for identifying a false user or a cheating user has the following advantages: 1) The original user behavior data is utilized, the judgment logic is popular and easy to understand, the interpretability is high, the acceptance is easy, and initiative is strived for the business in settlement negotiation; 2) When the user behavior data related to the user registration event (such as the current day) is only used for judging, long-term data accumulation is not needed, the abnormal judgment time is T+1 days, the time efficiency is greatly improved, and the timely recovery and the reduction of the loss of a product operator are facilitated; 3) And a third party library is not required to be additionally used, so that the operation cost is saved.
In one embodiment of the present invention, the apparatus further comprises: a content channel evaluation unit (not shown in fig. 2) adapted to count a proportion of users of a specified type among users registered according to each content channel within a preset period of time; and evaluating the quality of each content channel according to the proportion.
For example, the identified users are cheating users or false users, and when such users are high, it is apparent that the quality of the content channel is poor, and vice versa.
In summary, according to the technical scheme of the invention, the user behavior data of the target user is obtained, the IP address corresponding to the user behavior is extracted from the user behavior data, and whether the target user is a user of a specified type is identified according to the preset IP address judgment rule and the extracted IP address. The technical scheme carries out user identification from the dimension of the IP address, is suitable for identifying users generated by modes such as cheating and the like under the scenes such as user registration and the like, and is convenient for reducing the loss of popularization of products.
It should be noted that:
the algorithms and displays presented herein are not inherently related to any particular computer, virtual machine, or other apparatus. Various general purpose devices may also be used with the teachings herein. The required structure for the construction of such devices is apparent from the description above. In addition, the present invention is not directed to any particular programming language. It will be appreciated that the teachings of the present invention described herein may be implemented in a variety of programming languages, and the above description of specific languages is provided for disclosure of enablement and best mode of the present invention.
In the description provided herein, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be construed as reflecting the intention that: i.e., the claimed invention requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
Those skilled in the art will appreciate that the modules in the apparatus of the embodiments may be adaptively changed and disposed in one or more apparatuses different from the embodiments. The modules or units or components of the embodiments may be combined into one module or unit or component and, furthermore, they may be divided into a plurality of sub-modules or sub-units or sub-components. Any combination of all features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or units of any method or apparatus so disclosed, may be used in combination, except insofar as at least some of such features and/or processes or units are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings), may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
Furthermore, those skilled in the art will appreciate that while some embodiments described herein include some features but not others included in other embodiments, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the following claims, any of the claimed embodiments can be used in any combination.
Various component embodiments of the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that some or all of the functions of some or all of the components in a subscriber identity device according to embodiments of the invention may be implemented in practice using a microprocessor or Digital Signal Processor (DSP). The present invention can also be implemented as an apparatus or device program (e.g., a computer program and a computer program product) for performing a portion or all of the methods described herein. Such a program embodying the present invention may be stored on a computer readable medium, or may have the form of one or more signals. Such signals may be downloaded from an internet website, provided on a carrier signal, or provided in any other form.
For example, fig. 3 shows a schematic structural diagram of an electronic device according to an embodiment of the present invention. The electronic device comprises a processor 310 and a memory 320 arranged to store computer executable instructions (computer readable program code). The memory 320 may be an electronic memory such as a flash memory, an EEPROM (electrically erasable programmable read only memory), an EPROM, a hard disk, or a ROM. The memory 320 has a memory space 330 storing computer readable program code 331 for performing any of the method steps described above. For example, the memory space 330 for storing computer readable program code may include respective computer readable program code 331 for implementing the respective steps in the above method, respectively. The computer readable program code 331 can be read from or written to one or more computer program products. These computer program products comprise a program code carrier such as a hard disk, a Compact Disc (CD), a memory card or a floppy disk. Such a computer program product is typically a computer readable storage medium as described for example in fig. 4. Fig. 4 illustrates a schematic structure of a computer-readable storage medium according to an embodiment of the present invention. The computer readable storage medium 400 stores computer readable program code 331 for performing the steps of the method according to the invention, which may be read by the processor 310 of the electronic device 300, which computer readable program code 331, when executed by the electronic device 300, causes the electronic device 300 to perform the steps of the method described above, in particular the computer readable program code 331 stored by the computer readable storage medium may perform the method shown in any of the embodiments described above. The computer readable program code 331 may be compressed in a suitable form.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The use of the words first, second, third, etc. do not denote any order. These words may be interpreted as names.
The embodiment of the invention discloses A1, a user identification method, which comprises the following steps:
acquiring user behavior data of a target user;
extracting an IP address corresponding to the user behavior from the user behavior data;
and identifying whether the target user is a user of a specified type according to a preset IP address judging rule and the extracted IP address.
A2, the method of A1, wherein the obtaining the user behavior data of the target user comprises:
And extracting and sorting user behavior data of the target user from the user behavior dotting log according to the user identification.
A3. the method of A2, wherein the method further comprises:
Providing a front-end page comprising a plurality of buried points, and collecting the user behavior dotting logs according to the buried points;
The front page includes: registration pages and/or product pages.
A4, the method of A1, wherein the user behavior data of the target user is user behavior data related to user registration.
A5, the method of A1, wherein the identifying whether the target user is a user of a specified type according to the preset IP address judging rule and the extracted IP address comprises:
Determining the geographic position corresponding to the IP address;
and judging whether the geographic position is matched with the delivery area of the product content, otherwise, marking the target user as a user of a specified type.
A6, the method of A5, wherein the obtaining the geographic location corresponding to the IP address comprises:
And determining the geographic position corresponding to the IP address according to the preset mapping relation between the IP address and the geographic position.
A7, the method as in A6, wherein the mapping relation between the IP address and the geographic position is stored in a preset IP address positioning database.
A8, the method of A5, wherein the judging whether the geographic position is matched with the delivery area of the product content, if so, marking the target user as the user of the designated type comprises:
when the extracted IP addresses are multiple, if the geographic position corresponding to any IP address is not matched with the delivery area of the product content, marking the target user as a user of a specified type.
A9, the method of A1, wherein the method further comprises:
counting the proportion of users of a designated type among the users registered according to each content channel in a preset time period;
and evaluating the quality of each content channel according to the proportion.
The embodiment of the invention also discloses a B10 and a user identification device, which comprises:
the acquisition unit is suitable for acquiring user behavior data of a target user;
The extracting unit is suitable for extracting the IP address corresponding to the user behavior from the user behavior data;
And the identification unit is suitable for identifying whether the target user is a user of a specified type according to a preset IP address judgment rule and the extracted IP address.
B11, the device of B10, wherein,
The acquisition unit is suitable for extracting and sorting user behavior data of the target user from the user behavior dotting log according to the user identification.
B12, the device of B11, wherein,
The acquisition unit is suitable for providing a front-end page comprising a plurality of buried points, and collecting the user behavior dotting logs according to the buried points; the front page includes: registration pages and/or product pages.
B13, the device of B10, wherein the user behavior data of the target user is user behavior data related to user registration.
B14, the apparatus of B10, wherein,
The identification unit is suitable for determining the geographic position corresponding to the IP address; and judging whether the geographic position is matched with the delivery area of the product content, otherwise, marking the target user as a user of a specified type.
The device of B15, B14, wherein,
The identification unit is suitable for determining the geographic position corresponding to the IP address according to the preset mapping relation between the IP address and the geographic position.
B16, the device as described in B15, wherein the mapping relation between the IP address and the geographic position is stored in a preset IP address positioning database.
The apparatus of B17, B14, wherein,
And the identification unit is suitable for marking the target user as a user of a specified type if the geographic position corresponding to any one IP address is not matched with the delivery area of the product content when a plurality of extracted IP addresses exist.
B18, the apparatus of B10, wherein the apparatus further comprises:
the content channel evaluation unit is suitable for counting the proportion of users of a designated type among the users registered according to each content channel in a preset time period; and evaluating the quality of each content channel according to the proportion.
The embodiment of the invention also discloses C19 and electronic equipment, wherein the electronic equipment comprises: a processor; and a memory arranged to store computer executable instructions that, when executed, cause the processor to perform the method of any of A1-A9.
Embodiments of the invention also disclose D20, a computer readable storage medium storing one or more programs which, when executed by a processor, implement the method of any of A1-A9.
Claims (18)
1. A user identification method, comprising:
acquiring user behavior data of a target user;
extracting an IP address corresponding to the user behavior from the user behavior data;
Identifying whether the target user is a user of a specified type according to a preset IP address judging rule and the extracted IP address;
Wherein the identifying whether the target user is a user of a specified type according to the preset IP address judging rule and the extracted IP address includes:
Determining the geographic position corresponding to the IP address;
and judging whether the geographic position is matched with the delivery area of the product content, otherwise, marking the target user as a user of a specified type.
2. The method of claim 1, wherein the obtaining user behavior data of the target user comprises:
And extracting and sorting user behavior data of the target user from the user behavior dotting log according to the user identification.
3. The method of claim 2, wherein the method further comprises:
Providing a front-end page comprising a plurality of buried points, and collecting the user behavior dotting logs according to the buried points;
The front page includes: registration pages and/or product pages.
4. The method of claim 1, wherein the user behavior data of the target user is user behavior data associated with user registration.
5. The method of claim 1, wherein the obtaining the geographic location corresponding to the IP address comprises:
And determining the geographic position corresponding to the IP address according to the preset mapping relation between the IP address and the geographic position.
6. The method of claim 5, wherein the mapping relationship between the IP address and the geographic location is stored in a preset IP address location database.
7. The method of claim 6, wherein the determining whether the geographic location matches a delivery area of product content, and if so, marking the target user as a specified type of user comprises:
when the extracted IP addresses are multiple, if the geographic position corresponding to any IP address is not matched with the delivery area of the product content, marking the target user as a user of a specified type.
8. The method of claim 1, wherein the method further comprises:
counting the proportion of users of a designated type among the users registered according to each content channel in a preset time period;
and evaluating the quality of each content channel according to the proportion.
9. A user identification device, comprising:
the acquisition unit is suitable for acquiring user behavior data of a target user;
The extracting unit is suitable for extracting the IP address corresponding to the user behavior from the user behavior data;
the identification unit is suitable for identifying whether the target user is a user of a specified type according to a preset IP address judgment rule and the extracted IP address;
the identification unit is suitable for determining the geographic position corresponding to the IP address; and judging whether the geographic position is matched with the delivery area of the product content, otherwise, marking the target user as a user of a specified type.
10. The apparatus of claim 9, wherein,
The acquisition unit is suitable for extracting and sorting user behavior data of the target user from the user behavior dotting log according to the user identification.
11. The apparatus of claim 10, wherein,
The acquisition unit is suitable for providing a front-end page comprising a plurality of buried points, and collecting the user behavior dotting logs according to the buried points; the front page includes: registration pages and/or product pages.
12. The apparatus of claim 9, wherein the user behavior data of the target user is user behavior data related to user registration.
13. The apparatus of claim 9, wherein,
The identification unit is suitable for determining the geographic position corresponding to the IP address according to the preset mapping relation between the IP address and the geographic position.
14. The apparatus of claim 13, wherein the mapping relationship between the IP address and the geographic location is stored in a preset IP address location database.
15. The apparatus of claim 9, wherein,
And the identification unit is suitable for marking the target user as a user of a specified type if the geographic position corresponding to any one IP address is not matched with the delivery area of the product content when a plurality of extracted IP addresses exist.
16. The apparatus of claim 9, wherein the apparatus further comprises:
the content channel evaluation unit is suitable for counting the proportion of users of a designated type among the users registered according to each content channel in a preset time period; and evaluating the quality of each content channel according to the proportion.
17. An electronic device, wherein the electronic device comprises: a processor; and a memory arranged to store computer executable instructions which, when executed, cause the processor to perform the method of any of claims 1-9.
18. A computer readable storage medium, wherein the computer readable storage medium stores one or more programs which, when executed by a processor, implement the method of any of claims 1-9.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811270903.6A CN111105261B (en) | 2018-10-29 | 2018-10-29 | User identification method, device, electronic equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811270903.6A CN111105261B (en) | 2018-10-29 | 2018-10-29 | User identification method, device, electronic equipment and storage medium |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111105261A CN111105261A (en) | 2020-05-05 |
CN111105261B true CN111105261B (en) | 2024-04-19 |
Family
ID=70419927
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811270903.6A Active CN111105261B (en) | 2018-10-29 | 2018-10-29 | User identification method, device, electronic equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111105261B (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111105259B (en) * | 2018-10-29 | 2024-04-02 | 北京奇虎科技有限公司 | User identification method, device, electronic equipment and storage medium |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102595249A (en) * | 2011-12-12 | 2012-07-18 | 青岛海信电器股份有限公司 | Method and device as well as system for positioning geographic position of television |
CN106161389A (en) * | 2015-04-16 | 2016-11-23 | 北京金山安全软件有限公司 | Cheating identification method and device and terminal |
CN107330718A (en) * | 2017-06-09 | 2017-11-07 | 晶赞广告(上海)有限公司 | A kind of anti-cheat method of media and device, storage medium, terminal |
WO2017215523A1 (en) * | 2016-06-12 | 2017-12-21 | 腾讯科技(深圳)有限公司 | Method and device for identifying type of geographic location at where user is located |
CN108009844A (en) * | 2017-11-20 | 2018-05-08 | 北京智钥科技有限公司 | Determine the method, apparatus and Cloud Server of advertisement cheating |
CN108109011A (en) * | 2017-12-28 | 2018-06-01 | 北京皮尔布莱尼软件有限公司 | A kind of anti-cheat method of advertisement and computing device |
-
2018
- 2018-10-29 CN CN201811270903.6A patent/CN111105261B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102595249A (en) * | 2011-12-12 | 2012-07-18 | 青岛海信电器股份有限公司 | Method and device as well as system for positioning geographic position of television |
CN106161389A (en) * | 2015-04-16 | 2016-11-23 | 北京金山安全软件有限公司 | Cheating identification method and device and terminal |
WO2017215523A1 (en) * | 2016-06-12 | 2017-12-21 | 腾讯科技(深圳)有限公司 | Method and device for identifying type of geographic location at where user is located |
CN107330718A (en) * | 2017-06-09 | 2017-11-07 | 晶赞广告(上海)有限公司 | A kind of anti-cheat method of media and device, storage medium, terminal |
CN108009844A (en) * | 2017-11-20 | 2018-05-08 | 北京智钥科技有限公司 | Determine the method, apparatus and Cloud Server of advertisement cheating |
CN108109011A (en) * | 2017-12-28 | 2018-06-01 | 北京皮尔布莱尼软件有限公司 | A kind of anti-cheat method of advertisement and computing device |
Non-Patent Citations (2)
Title |
---|
Web user behavioral profiling for user identification;yinghui yang;Decision Sopport System;20100309;第49卷(第3期);261-271 * |
基于位置服务的广告投放系统的研究;段丽;郭丽丽;;黑龙江科技信息;20170615(第17期);171 * |
Also Published As
Publication number | Publication date |
---|---|
CN111105261A (en) | 2020-05-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106022834B (en) | Advertisement anti-cheating method and device | |
CN106384273B (en) | Malicious bill-swiping detection system and method | |
CN107248082B (en) | Card maintenance identification method and device | |
CN109034583A (en) | Abnormal transaction identification method, apparatus and electronic equipment | |
CN107800894B (en) | Intelligent voice prompt method and terminal equipment | |
CN104731816A (en) | Method and device for processing abnormal business data | |
CN106874335B (en) | Behavior data processing method and device and server | |
CN111523748B (en) | Case distribution method, device and system | |
CN105426759A (en) | URL legality determining method and apparatus | |
CN113379526A (en) | Intelligent invoice reimbursement method and device, electronic equipment and computer storage medium | |
CN110310020A (en) | Project alternative management method, relevant apparatus and storage medium based on data analysis | |
CN111104628A (en) | User identification method and device, electronic equipment and storage medium | |
CN111105261B (en) | User identification method, device, electronic equipment and storage medium | |
CN111105259B (en) | User identification method, device, electronic equipment and storage medium | |
CN111105263B (en) | User identification method, device, electronic equipment and storage medium | |
CN108197050B (en) | Equipment identification method, device and system | |
CN111127050A (en) | Content channel evaluation method and device, electronic equipment and storage medium | |
US7987123B1 (en) | Method and system for providing market analysis for wireless data markets | |
CN110348983B (en) | Transaction information management method and device, electronic equipment and non-transitory storage medium | |
CN112085332A (en) | Data processing method, data processing device, computer equipment and storage medium | |
CN111105262B (en) | User identification method, device, electronic equipment and storage medium | |
CN111105260B (en) | User identification method, device, electronic equipment and storage medium | |
CN108881591B (en) | Multi-platform information recommendation method and device and storage medium | |
CN108629610B (en) | Method and device for determining popularization information exposure | |
CN110992161A (en) | Accounting method and device and electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |