[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

US5745654A - Fast explanations of scored observations - Google Patents

Fast explanations of scored observations Download PDF

Info

Publication number
US5745654A
US5745654A US08/600,858 US60085896A US5745654A US 5745654 A US5745654 A US 5745654A US 60085896 A US60085896 A US 60085896A US 5745654 A US5745654 A US 5745654A
Authority
US
United States
Prior art keywords
percentile
bin
observations
input variable
variable
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US08/600,858
Inventor
Hari Titan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fair Isaac Corp
Original Assignee
HNC Software Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by HNC Software Inc filed Critical HNC Software Inc
Priority to US08/600,858 priority Critical patent/US5745654A/en
Assigned to HNC SOFTWARE, INC. reassignment HNC SOFTWARE, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: TITAN, HARI
Application granted granted Critical
Publication of US5745654A publication Critical patent/US5745654A/en
Assigned to FAIR ISAAC CORPORATION reassignment FAIR ISAAC CORPORATION MERGER (SEE DOCUMENT FOR DETAILS). Assignors: HNC SOFTWARE, INC.
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/10Interfaces, programming languages or software development kits, e.g. for simulating neural networks
    • G06N3/105Shells for specifying net layout

Definitions

  • This invention relates generally to the field of neural network scoring engines and specifically to the field of fast explanations for neural network scores, without scoring fabricated neural network inputs.
  • Neural networks are computational systems that are generally capable of modeling the non-linear aspects of complex data sets.
  • a neural net is capable of being trained with a set of observations, each observation comprising values for a set input variables, and a known value of an output variable. From this set of observations, the neural net develops a set of weightings that model a complex function describing the relationship between the input set and an output.
  • inputs for a new observation can be provided to the neural net, which will then produce an output, or a neural network score based on the learned model. This score may then be used for classification, pattern matching, and the like, with scores in particular ranges being associated with determined categories in the output space.
  • neural nets Conventionly do not inherently include mechanisms that provide an explanation of a particular output or score, or how it was generated for a particular input set. This is because the function that maps the input to the output score is opaque, held by the weightings in the neural net. Thus, conventional neural nets are limited in how well they can explain the reasons for a particular scoring, such as a classification of an input.
  • neural nets are used in credit card fraud detection to, detect whether a particular transaction is fraudulent.
  • the input here may be a set of variables describing the transaction, such as amount, location, item, time, last transaction and the like.
  • neural nets are used in loan analysis to determine whether a particular borrower is credit worthy; here the input is a set of variables describing the loan amount, salary, credit experience, and the like.
  • Some approaches attempt to provide explanations by determining which input variable(s), if alone missing from the input observation, would most alter the neural network score.
  • the method essentially creates additional observations for each input variable by replacing the value of that input variable with a population average value from a known population, such as the training data set. Each fabricated input is scored and compared with the score of the real observation. The variable when replaced with its population average value that scores most differently from the real observation score is identified as the most significant variable.
  • this type of method has several problems.
  • this method requires iterative re-scoring, using the created values for each input variable. For example, if there are 100 input variables, then there should be 100 re-scorings for a single observation.
  • observations In a high throughput environment, such as in online transaction processing for credit card transactions, where thousands, or tens of thousands of observations are processed in an hour, observations must be scored too rapidly to allow time for such re-scoring of each additional neural net inputs at explanation time. The time required to re-score the observation for the additional inputs is simply too great for the constraints of the application domain. This limits the use of explanations to only a selected number of observations.
  • the present invention overcomes the limitations of existing systems by providing a system, method, and product that enables the fast determination of an explanation of a neural net score for any particular input data set having any number of input variables.
  • the invention provides a software product in the form of an explanation tool.
  • the explanation tool takes a set of historical observations, such as those used to train a neural net, and for each input variable, places the output score from the neural net into one of a selected number of percentile bins, according to the value of the input variable.
  • a set of such percentile bins is produced for each input variable, each percentile bin defined by an upper and lower boundary value for the values of the input variable for the bin. The upper and lower boundary values of each bin are likely to be different for each input variable.
  • the number of percentile bins is selected by the user. For example, it may be useful to use three bins for each input variable, a "high” bin, a “medium” bin, and a “low” bin. A given input variable values will have a score in one of these three ranges, and the neural net score for the observation containing that input variable value is placed in the bin. There are at most N ⁇ M bins where N is the number of input variables in an observation, and M is a maximum number of desired bins. The number of bins for an input variable is less than the maximum M, and adjustable to ensure a minimum number of observations per bin. Where there are K observations, there are at least K/M observations in each percentile bin.
  • Each input variable, and each percentile bin may be associated with an explanation.
  • An explanation of an input variable describes the definition, purpose, or significance of the variable.
  • An explanation of a percentile bin describes the meaning of an input variable having a value in that percentile bin. For example, if an input variable is "Amount of Transaction" a percentile bin that holds the largest values for this input variable may have an explanation such as "The amount of the transaction is too high.”
  • an average, mean, or other expected neural net score is determined for the scores in the percentile bin, and this expected neural net score is assigned to the percentile bin. At this point, preprocessing is complete, and an explanation of any new observation may be quickly generated.
  • the new observation is scored by the neural network to produce a new neural network score.
  • the value of the input variable of the new observation is compared with the boundary values for the percentile bins associated with the input variable to identify the percentile bin in which the value of the input variable falls.
  • the expected neural network score associated with this percentile bin is retrieved, and compared with the new neural network score. This process is repeated for each input variable in the observation.
  • the input variable for which the expected neural network score of the percentile bin is closest to the actual neural network score is the most significant variable, and the explanation associated with that percentile bin, and if available, input variable is provided.
  • the explanations of percentile bins having the next closest expected neural network score may also be provided as secondary explanations.
  • the invention provides an extremely fast means of determining the explanation for a neural network score. This is because only simple lookup operations are required during runtime analysis, rather than the time consuming re-scoring of each input variable with artificial inputs as conventionally required. This allows a large number of observations to be "explained", rather than only a selected number.
  • the invention applies to multiple output models on discrete or continuous inputs, with no assumption about the linearity of the underlying neural net model.
  • the invention is independent of the neural net architecture itself, and may be used with any neural net design. This allows the invention to be efficiently implemented with any type of neural network, including those already operating in existing systems. This allows operators of an existing system to improve the utility of their existing neural networks.
  • FIG. 1 is an illustration of a system in accordance with one embodiment of the present invention.
  • FIG. 2 is a schematic representation of a table of percentile bins and explanations.
  • FIG. 3 is an illustration of the data relationships in the process of creating the table of percentile bins.
  • FIG. 4 is an example of a table of percentile bins for a single input variable.
  • FIG. 5 is a dataflow diagram of the process of determination an explanation for a new observation.
  • FIG. 6 is an illustration of an exemplary user interface for the explanation tool of the present invention.
  • the system 100 receives some number of observations 210 as inputs, and provides, through a suitable user interface 200, a score 220 for an observation 210, and one or more explanations 230 that describe the reasons for the score 220.
  • the system 100 operates on a conventional computer 110 and operating system; suitable computers 110 include Sun Microsystem's workstations using the Solaris 2.4 operating system.
  • the system 100 includes a database 120 that stores the observations 210.
  • An observation 210 is formed from a set of values for some number of input variables.
  • the input variables may be continuous, Boolean, or discrete. Symbolic variables may be to mapped discrete values by the systems designer.
  • the database 120 stores the observations 210 in any useful manner, such as a relational database model.
  • the database 120 is designed for online transaction processing environments, such as credit card transaction analysis, and the similar application domains requiring high throughput. It these in applications, that ability of the present invention to provide rapid determinations of the explanations 230 for an observation's score 220 becomes most useful.
  • the computer 110 is coupled through a conventional high speed data network to receive observations 210 from remote computers.
  • the system 100 includes a neural network 180.
  • the neural network 180 may have an arbitrary number of layers, and, with any arbitrary linear or non-linear bounded activation functions.
  • Selected observations 210 in the database 120 (a training set) are processed by a learning algorithm 130 to determine the weights for the neural network 180.
  • the learning algorithm 130 is typically a back propagation algorithm.
  • a feedforward algorithm 190 applies the input variables of the observation 120 to the neural network 180 in a conventional manner.
  • the score 220 is a score value for an output variable.
  • the score 220 is typically used to model some target variable 240.
  • the score 220 may be used to determine a target Boolean variable, indicating the presence or absence of fraud in the transaction.
  • the target variable 240 may be subsequently input for an observation 210.
  • the system 100 includes a preprocessing module 140 and an explanation module 170.
  • the preprocessing module 140 receives some number of observations 210 from the database 120 in conjunction with the scores 220 from the feedforward algorithm 180 for these observations 120.
  • the preprocessing module 140 creates a table of percentile bins 150 from the values of the input variables of the received observations 120, and the corresponding scores 220. Dataflow for preprocessing is shown as dashed lines.
  • each percentile bin 150 is associated with an expected score 250 for the observation 120.
  • the expected score 250 for a percentile bin is a representative score based on actual scores 220 of observations 120 for which the value of the input variable is within a range of percentiles associated with the percentile bin.
  • the explanation module 170 compares the score 220 for the observation with the expected score 250 in the percentile bin 150 for the input variable that contains the value of the input variable.
  • the explanations 230 associated with the percentile bin(s) 150 having the expected score 250 that most closely matches the actual score 220 are selected, and provided to the user interface 200 for display to a user.
  • the table includes attributes for an input variable number 152, which a foreign key to a table 306 listing the input variables in the database, each having an input variable number 152 and name 308.
  • the input variable number 152 in either table may be used as a foreign key to the explanation table 300 for providing explanations 230 of variables, such as definitions, significance, use, and the like.
  • Each input variable in the table has a determined number of percentile bins 150, and each percentile bin 150 has a bin number 154.
  • the bin number 154 usefully identifies the ranking of the bin, the lowest bin number 154 being the highest ranked bin.
  • the number of percentile bins is established by maximum number M of percentile bins 150. M is determined by the systems designer and should be selected such that M ⁇ K, where K is the number of observations. The number of bins may be less than M through adjustable binning, further described below. In one embodiment, M is set at 20, and K is preferably >1,000. M, the maximum number of percentile bins, establishes the minimum number of observations 210 used to establish the expected score 250 of each bin, as:
  • the maximum value (exclusive) of a bin is the lowest value (inclusive) of the next higher ranked bin.
  • the highest ranked bin has no maximum value and the lowest ranked bin has no lowest value.
  • the number of observations field 158 stores how many observations 210 were used to determine the expected score 250 for the percentile bin. This quantity should always be greater than the minimum number of observations.
  • each percentile bin has an explanation 230 associated with it.
  • the explanation 230 is associated by an explanation code 302 which is a foreign key to an explanation table 300.
  • other forms of reference may be used.
  • the explanations 230 are provided by the system designer and are dependent on the application domain. In some application domains it may not be necessary for each percentile bin 150 to have an explanation 230. For example, in a credit card fraud detection system, scores 220 in excess of some threshold score are classified as fraudulent transactions, and transactions with scores 220 less than the threshold are authorized. In this application, it is useful only to know the explanation of fraudulent transactions. Thus, only those percentile bins 150 that have expected scores 250 in excess of the threshold have explanation 230 associated with them. Again, this decision is implementation dependent, and not a limitation of the invention.
  • the table of percentile bins 150 and expected scores 250 are constructed by the preprocessing module 140 as follows.
  • a number K of observations 120 are selected 310 as the base, or training set for the percentile tables 150.
  • the number K of observations should be suitably large for the application domain.
  • These K observations 120 are scored 312 by the neural network 180 and have associated scores 250.
  • each observation 120 There are N input variables for each observation 120.
  • the selected observations 120 are ranked (sorted) 314 in descending order by the value of the input variable Var j . This will produce N ranked sets of observations 120.
  • the preprocessing module 140 determines the range of observations 120 in the ranked set that defines each percentile bin 150 for that input variable. Each percentile bin 150 will have at least K/M observations 120.
  • the i th percentile bin B j for this input variable Var j is B j ,i, and has as its minimum value 156 the observation O x :
  • the smallest value of the input variable in each of its (100/M)% ranked observations define a percentile bin 150 of that input variable, except for the lowest ranked bin, which is defined by the minimum value of the second lowest ranked bin.
  • the expected score 250 for each percentile bin 150 is then determined 316 by the preprocessing module 140 from the scores 220 in the percentile bin 150 as:
  • E(B j ,i) is the expected score 250 for percentile bin B j ,i.
  • the lowest ranked bin is defined by the preprocessing module 140 by the minimum value of the second lowest ranked bin 150.
  • Each percentile bin B j ,i has at least (K*i)/M scores 220 associated with it, excepts for the lowest ranked bin B j ,M, which may have less.
  • the expected score 250 is determined by the preprocessing module 140 as the sum of scores 220 in the multiplied by (M/K). Note that the preprocessing module 140 does not store values of the input variable Var j itself in the percentile bin 150, but merely uses them to define the boundaries of the percentile bin 150. It should also be noted that the preprocessing module 140 may create the ranked set of observations 210 for each input variable prior to (separate loops), or in conjunction with (nested loops) with determining the percentile bin expected scores 250.
  • the preferred embodiment of the preprocessing module 140 uses a mean calculation for the expected score 250.
  • the invention is not limited to the use of mean for the expected score 250, and other summary measures, such as mode, median, and the like may also be used.
  • the expected score 205 for a percentile bin 150 is the expected output for the neural network 180 given a value for the input variable that falls within the percentile bin 150.
  • the above rankings by the preprocessing module 140 can produce ties at the borders between percentile bins 150. This may happen, for example, for discrete input variables. It is preferable to keep all observations 120 of a particular value of an input variable in one percentile bin 150 or the other. To do so, the scores 220 of observations 150 for which value of the input variable is at minimum value 156, are moved into the next higher ranked percentile bin.
  • the preprocessing module 140 merges the lowest ranked percentile bin 150 with the second lowest ranked percentile bin 150.
  • the the preprocessing module 140 determines the expected score 250 in the resulting percentile bin 150 as above, based on the number of observations 158 in the merged percentile bin 150.
  • the merging of bins occurs with the lowest, and second lowest ranked bins because the highest ranked bin through the second lowest ranked bin have at least the minimum number of observations, since the bins are created from the "top" of the ranked set of observations. Thus, only the lowest ranked bin will have less than the minimum number of observations 210 prior to adjustment.
  • This adjustable binning by the preprocessing module 140 is particularly useful for discrete input variables.
  • each input variable may have any number percentile bins 150 such that the minimum number of observations are used to determine the expected score 250 for each bin.
  • the preprocessing module 140 may determine an expected target 160 for for each percentile bin 150 in the manner described above for the expected score 250, using the target value 240 in place of the score 220.
  • the expected score 250 for a percentile bin is tested by the preprocessing module 140 for how closely it approximates the expected target 240.
  • One implementation of this determination by the preprocessing module 140 uses a simple a confidence threshold e, and determines whether:
  • T(B j ,i) is the expected target 160 for percentile bin B j ,i. If this value exceeds e for any percentile bin 150, then either the number of bins for the input variable is decreased, or K is increased, and the percentile bins for that input variable are redetermined.
  • the primary association created by the preprocessing module 140 in the table of percentile bins 150 is the relationship between the expected score 250 and the explanation 230 for each percentile bin.
  • the provision of a bin number 154 and variable number 152 is useful for indexing and retrieving the expected scores 250 and explanation codes 302.
  • the number of observations 158 is useful for defining the percentile bins 150, and may be used to support the statistical accuracy of the expected score in a bin at some future time, though this is not necessary for the purposes of the invention.
  • the minimum value 156 is used by the explanation module 170 to identify the appropriate percentile bin from which to retrieve the expected score 250.
  • FIG. 4 illustrates a sample table of percentile bins 150 for a single input variable, "SIC -- RISK" in the credit card fraud application, described with respect to FIG. 6, in association with explanations 230 for the percentile bins 150.
  • Variable number “1” refers to the "SIC -- RISK” variable, as established in a separate table.
  • the bin numbers 154 indicate that there are 5 percentile bins 150 for this input variable.
  • Each percentile bin 150 has a number 156 of observations from the total set of observations 210 use to generate the table 150. An expected score 250 is held for each bin.
  • each of the percentile bins 150 includes an explanation code 302 to establish the association to a particular explanation 230 in the explanation table 300. Only a portion of the explanation table 300 Percentile bin 1 has an explanation 230 that observations 210 for which the value of the input variable SIC -- RISK was greater than 5.12 are credit card transactions taking place in a high risk type store. Note that bins no. 1 and 2 share the same explanation code 302 and hence explanation 230, even though they define separate bins 150.
  • a new observation 210 is received into the database 120.
  • the observation 210 may be entered as direct data input manually, or through automated data capture, for example, from a point of sale system, or other data acquisition device.
  • the observation 210 has values for some number of the input variables (values for all input variables is not necessary, as with conventional neural nets).
  • the new observation 210 is provided to both the feedforward algorithm 190 and the explanation module 170.
  • the feedforward algorithm 190 propagates the observation 210 through the neural network 180 and produces a score 220, and a target 240 which may be used to classify the observation 220 with a target variable 240.
  • the score 220, and if appropriate, the target 240 is provided to both the explanation module 170 and the user interface 200.
  • the explanation module 170 traverses the table of percentile bins 150 for each of the input variables Var j for the observation 210. For the input variable Var j , the explanation module 170 traverses the percentile bins B j ,i, where i indexes the percentile bins 150 for input variable Var j in the table 150. The explanation module 170 compares Var j with the minimum value 156 for the percentile bin B j ,i to identify the percentile bin 150 that contains the value of that input variable. The expected score 250 for that particular percentile bin 150 is retrieved, and the difference between the expected score 250 and the actual score 220 determined. This may be notated as:
  • S is the score 220 from the new observation 120
  • D j is the difference.
  • the difference result D j is temporarily stored in a table 400 along with the explanation code 302 from the percentile bin B j ,i.
  • the table 400 of difference results is not persistently stored, since it is useful only for the particular observation 210.
  • This difference result D j is determined for all of the input variables Var j .
  • a confidence threshold c is used to select only those expected scores 250 that are sufficiently close to the actual score 220 that input variable may be said to have significantly contributed to the actual score 220.
  • a confidence threshold c is used to select only those expected scores 250 that are sufficiently close to the actual score 220 that input variable may be said to have significantly contributed to the actual score 220.
  • the resulting table 400 of difference results is then sorted, low values to high.
  • the lowest difference result D 1 represents the input variable that most strongly contributes to the actual score 220 for the observation, and further, the value of that input variable, as falling within a particular percentile bin, has a particular explanation 230 associated with it.
  • the explanation 230 that is associated with this difference result D 1 is then retrieved from the explanation table 300.
  • the explanations 230 associated with the remaining difference results D j in the table may also be retrieved.
  • These difference results represent the input variable percentile bins 150 having the second through jth closest expected score 250.
  • These explanations 230 for the observation 210 are provided to the user interface 200.
  • the user interface 200 provides the target 240 and explanation(s) 230 to the user.
  • the score 220 may also be provided to the user, if useful in a specific application.
  • the explanation module 170 provides the explanation code 302 to the user interface 200, which then itself performs the lookup on the explanation table 300 for the explanation 230 corresponding to the explanation code.
  • the user interface 200 provides the explanation 230 to the user of the input variables and values thereof that most significantly determined the score 220 for the observation 120.
  • the user interface 200 and explanation module 170 are shown in FIG. 1 as part of the computer 110 that operates the other elements of the present invention.
  • the user interface 200 and explanation module 170 are provided in a separate client computer, with the computer 110 acting as a server computer, and providing access to the table of percentile bins 150.
  • the user interface 200 here is for an implementation of the system 100 in a credit card authorization and fraud detection system.
  • the user interface 200 here shows various information about an individual card holder, such as account number, name, date of birth, address, and the like, that may be obtained from an enterprise database coupled to the system 100 (not shown in FIG. 1).
  • an observation 210 is a credit card transaction subject to authorization and the determination of whether the transaction is fraudulent.
  • input variables such as the amount of purchase, the geographic location, prior purchase amounts, and the like for the observation 210.
  • the target variable is whether the transaction is fraudulent.
  • the user interface 200 illustrates the explanations 230 for one such determination.
  • the three most significant explanations 230 are provided. These explanations 230 are those associated with the percentile bins for the input variables of transaction dollar amount, previous transaction amounts, and geographic location of transaction.
  • the actual score 220 produced by the neural network 180 is 699 and appears under the column FRD -- SCOR.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computing Systems (AREA)
  • Software Systems (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Biophysics (AREA)
  • General Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Financial Or Insurance-Related Operations Such As Payment And Settlement (AREA)

Abstract

A system, method, and product provide rapid explanations for the scores determined by a neural network for new observations input into the neural network. The explanations are associated with a table of percentile bins for each of the input variables used to define the observation. The table contains for each input variable a number of percentile bins. Each percentile bin contains an expected score for values of the input variable containing with the percentile bin. The expected score in each percentile bin is determined from historical observation data. Preferably each percentile bin is associated with an explanation that describes the meaning of the value of the input variable falling within the percentile bin. During observation processing, a new observation is scored. The value of each input variable in the new observation is compared with the percentile bins for the input variable in the table. The expected score in the percentile bin that contains the value of the input variable is retrieved, and this is repeated for all input variables in the new observation. The explanation associated with the percentile bin that has an expected score closest to the actual score is retrieved and provided as the explanation of the most significant input variable accounting for score. Other explanations from the next closest expected scores may also be retrieved.

Description

BACKGROUND
1. Field of Invention
This invention relates generally to the field of neural network scoring engines and specifically to the field of fast explanations for neural network scores, without scoring fabricated neural network inputs.
2. Background of Invention
Neural networks are computational systems that are generally capable of modeling the non-linear aspects of complex data sets. In particular, a neural net is capable of being trained with a set of observations, each observation comprising values for a set input variables, and a known value of an output variable. From this set of observations, the neural net develops a set of weightings that model a complex function describing the relationship between the input set and an output. Once so trained, inputs for a new observation can be provided to the neural net, which will then produce an output, or a neural network score based on the learned model. This score may then be used for classification, pattern matching, and the like, with scores in particular ranges being associated with determined categories in the output space.
One known problem with neural nets is that they conventionally do not inherently include mechanisms that provide an explanation of a particular output or score, or how it was generated for a particular input set. This is because the function that maps the input to the output score is opaque, held by the weightings in the neural net. Thus, conventional neural nets are limited in how well they can explain the reasons for a particular scoring, such as a classification of an input.
For example, neural nets are used in credit card fraud detection to, detect whether a particular transaction is fraudulent. The input here may be a set of variables describing the transaction, such as amount, location, item, time, last transaction and the like. Similarly, neural nets are used in loan analysis to determine whether a particular borrower is credit worthy; here the input is a set of variables describing the loan amount, salary, credit experience, and the like.
For the credit card application, if a particular transaction is scored, or classified, as fraudulent, it is desirable to provide an explanation for the classification, in particular, the input variables that most significantly caused the resulting score. Likewise for the loan application it is desirable to provide an explanation of the input variables that lead to a denial of credit.
More generally then, it is desirable to determine first, which input variable alone most causes one to expect the neural network score generated for a particular observation, and second, which other input variable(s) most causes one to expect the neural network score generated for a particular observation? Such explanations for the score produced by a neural net are useful prior to taking specific actions based on the score, for example to ensure compliance with company procedure, legal requirements, and the like.
Various methods have been developed in an attempt to provide explanations for neural net scores. Some approaches attempt to provide explanations by determining which input variable(s), if alone missing from the input observation, would most alter the neural network score. The method essentially creates additional observations for each input variable by replacing the value of that input variable with a population average value from a known population, such as the training data set. Each fabricated input is scored and compared with the score of the real observation. The variable when replaced with its population average value that scores most differently from the real observation score is identified as the most significant variable.
This type of method has several problems. First, this method requires iterative re-scoring, using the created values for each input variable. For example, if there are 100 input variables, then there should be 100 re-scorings for a single observation. In a high throughput environment, such as in online transaction processing for credit card transactions, where thousands, or tens of thousands of observations are processed in an hour, observations must be scored too rapidly to allow time for such re-scoring of each additional neural net inputs at explanation time. The time required to re-score the observation for the additional inputs is simply too great for the constraints of the application domain. This limits the use of explanations to only a selected number of observations.
Second, by comparing with scores at the population averages, this method assumes the neural network model is effectively linear, which is not often true. Finally, if an observation is very different from those seen during modeling, basing reasons on scores of fabricated neural net inputs alone can produce highly unreliable score explanation.
Accordingly, it is desirable to provide a mechanism for providing explanations of neural network scores that is sufficiently fast that it is capable of use in high throughput systems where an explanation of many or most observations is desired. It is also desirable to provide an explanation mechanism that does not assume an underlying linearity to the neural net modeling function.
SUMMARY OF THE INVENTION
The present invention overcomes the limitations of existing systems by providing a system, method, and product that enables the fast determination of an explanation of a neural net score for any particular input data set having any number of input variables.
In one aspect, the invention provides a software product in the form of an explanation tool. The explanation tool takes a set of historical observations, such as those used to train a neural net, and for each input variable, places the output score from the neural net into one of a selected number of percentile bins, according to the value of the input variable. A set of such percentile bins is produced for each input variable, each percentile bin defined by an upper and lower boundary value for the values of the input variable for the bin. The upper and lower boundary values of each bin are likely to be different for each input variable.
The number of percentile bins is selected by the user. For example, it may be useful to use three bins for each input variable, a "high" bin, a "medium" bin, and a "low" bin. A given input variable values will have a score in one of these three ranges, and the neural net score for the observation containing that input variable value is placed in the bin. There are at most N×M bins where N is the number of input variables in an observation, and M is a maximum number of desired bins. The number of bins for an input variable is less than the maximum M, and adjustable to ensure a minimum number of observations per bin. Where there are K observations, there are at least K/M observations in each percentile bin.
Each input variable, and each percentile bin, may be associated with an explanation. An explanation of an input variable describes the definition, purpose, or significance of the variable. An explanation of a percentile bin describes the meaning of an input variable having a value in that percentile bin. For example, if an input variable is "Amount of Transaction" a percentile bin that holds the largest values for this input variable may have an explanation such as "The amount of the transaction is too high."
For each percentile bin, an average, mean, or other expected neural net score is determined for the scores in the percentile bin, and this expected neural net score is assigned to the percentile bin. At this point, preprocessing is complete, and an explanation of any new observation may be quickly generated.
To generate an explanation, first the new observation is scored by the neural network to produce a new neural network score. For each input variable in the observation, the value of the input variable of the new observation is compared with the boundary values for the percentile bins associated with the input variable to identify the percentile bin in which the value of the input variable falls. The expected neural network score associated with this percentile bin is retrieved, and compared with the new neural network score. This process is repeated for each input variable in the observation.
The input variable for which the expected neural network score of the percentile bin is closest to the actual neural network score is the most significant variable, and the explanation associated with that percentile bin, and if available, input variable is provided. The explanations of percentile bins having the next closest expected neural network score may also be provided as secondary explanations.
This approach provides numerous benefits over existing systems and methods. First, the invention provides an extremely fast means of determining the explanation for a neural network score. This is because only simple lookup operations are required during runtime analysis, rather than the time consuming re-scoring of each input variable with artificial inputs as conventionally required. This allows a large number of observations to be "explained", rather than only a selected number.
Second, the invention applies to multiple output models on discrete or continuous inputs, with no assumption about the linearity of the underlying neural net model.
Finally, the invention is independent of the neural net architecture itself, and may be used with any neural net design. This allows the invention to be efficiently implemented with any type of neural network, including those already operating in existing systems. This allows operators of an existing system to improve the utility of their existing neural networks.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is an illustration of a system in accordance with one embodiment of the present invention.
FIG. 2 is a schematic representation of a table of percentile bins and explanations.
FIG. 3 is an illustration of the data relationships in the process of creating the table of percentile bins.
FIG. 4 is an example of a table of percentile bins for a single input variable.
FIG. 5 is a dataflow diagram of the process of determination an explanation for a new observation.
FIG. 6 is an illustration of an exemplary user interface for the explanation tool of the present invention.
DETAILED DESCRIPTION OF THE INVENTION
Referring now to FIG. 1, there is shown block diagram of one embodiment of a system in accordance with the present invention. The system 100 receives some number of observations 210 as inputs, and provides, through a suitable user interface 200, a score 220 for an observation 210, and one or more explanations 230 that describe the reasons for the score 220.
The system 100 operates on a conventional computer 110 and operating system; suitable computers 110 include Sun Microsystem's workstations using the Solaris 2.4 operating system.
The system 100 includes a database 120 that stores the observations 210. An observation 210 is formed from a set of values for some number of input variables. The input variables may be continuous, Boolean, or discrete. Symbolic variables may be to mapped discrete values by the systems designer. The database 120 stores the observations 210 in any useful manner, such as a relational database model. In the preferred embodiment, the database 120 is designed for online transaction processing environments, such as credit card transaction analysis, and the similar application domains requiring high throughput. It these in applications, that ability of the present invention to provide rapid determinations of the explanations 230 for an observation's score 220 becomes most useful. Preferably then, the computer 110 is coupled through a conventional high speed data network to receive observations 210 from remote computers.
The system 100 includes a neural network 180. The neural network 180 may have an arbitrary number of layers, and, with any arbitrary linear or non-linear bounded activation functions. Selected observations 210 in the database 120 (a training set) are processed by a learning algorithm 130 to determine the weights for the neural network 180. The learning algorithm 130 is typically a back propagation algorithm.
To produce a score 220 for an observation 120 with the neural network 180, a feedforward algorithm 190 applies the input variables of the observation 120 to the neural network 180 in a conventional manner. The score 220 is a score value for an output variable. The score 220 is typically used to model some target variable 240. For example, in a credit card fraud detection application, the score 220 may be used to determine a target Boolean variable, indicating the presence or absence of fraud in the transaction. The target variable 240 may be subsequently input for an observation 210.
In accordance with one embodiment of the present invention, the system 100 includes a preprocessing module 140 and an explanation module 170. The preprocessing module 140 receives some number of observations 210 from the database 120 in conjunction with the scores 220 from the feedforward algorithm 180 for these observations 120. The preprocessing module 140 creates a table of percentile bins 150 from the values of the input variables of the received observations 120, and the corresponding scores 220. Dataflow for preprocessing is shown as dashed lines.
More particularly, for each input variable a set of percentile bins is created. Each percentile bin 150 is associated with an expected score 250 for the observation 120. The expected score 250 for a percentile bin is a representative score based on actual scores 220 of observations 120 for which the value of the input variable is within a range of percentiles associated with the percentile bin.
At run time, reasons for the score 220 of a new observation 210 are provided by the explanation module 170. The explanation module 170 compares the score 220 for the observation with the expected score 250 in the percentile bin 150 for the input variable that contains the value of the input variable. The explanations 230 associated with the percentile bin(s) 150 having the expected score 250 that most closely matches the actual score 220 are selected, and provided to the user interface 200 for display to a user.
Referring now to FIG. 2, there is shown a schema of the table of percentile bins 150 created by the preprocessing module 140. The table includes attributes for an input variable number 152, which a foreign key to a table 306 listing the input variables in the database, each having an input variable number 152 and name 308. The input variable number 152 in either table may be used as a foreign key to the explanation table 300 for providing explanations 230 of variables, such as definitions, significance, use, and the like.
Each input variable in the table has a determined number of percentile bins 150, and each percentile bin 150 has a bin number 154. The bin number 154 usefully identifies the ranking of the bin, the lowest bin number 154 being the highest ranked bin. The number of percentile bins is established by maximum number M of percentile bins 150. M is determined by the systems designer and should be selected such that M<K, where K is the number of observations. The number of bins may be less than M through adjustable binning, further described below. In one embodiment, M is set at 20, and K is preferably >1,000. M, the maximum number of percentile bins, establishes the minimum number of observations 210 used to establish the expected score 250 of each bin, as:
Minimum number of observations per bin=K/M.
The maximum value (exclusive) of a bin is the lowest value (inclusive) of the next higher ranked bin. The highest ranked bin has no maximum value and the lowest ranked bin has no lowest value.
The number of observations field 158 stores how many observations 210 were used to determine the expected score 250 for the percentile bin. This quantity should always be greater than the minimum number of observations.
Ideally, each percentile bin has an explanation 230 associated with it. In a relational model as shown here, the explanation 230 is associated by an explanation code 302 which is a foreign key to an explanation table 300. In other embodiments, other forms of reference may be used. The explanations 230 are provided by the system designer and are dependent on the application domain. In some application domains it may not be necessary for each percentile bin 150 to have an explanation 230. For example, in a credit card fraud detection system, scores 220 in excess of some threshold score are classified as fraudulent transactions, and transactions with scores 220 less than the threshold are authorized. In this application, it is useful only to know the explanation of fraudulent transactions. Thus, only those percentile bins 150 that have expected scores 250 in excess of the threshold have explanation 230 associated with them. Again, this decision is implementation dependent, and not a limitation of the invention.
Referring now to FIG. 3, the table of percentile bins 150 and expected scores 250 are constructed by the preprocessing module 140 as follows.
First, a number K of observations 120 are selected 310 as the base, or training set for the percentile tables 150. The number K of observations should be suitably large for the application domain. These K observations 120 are scored 312 by the neural network 180 and have associated scores 250.
There are N input variables for each observation 120. For purposes of notation, value of each input variable is here referenced as Varj where j=1 to N. For each of the N input variables Varj, the selected observations 120 are ranked (sorted) 314 in descending order by the value of the input variable Varj. This will produce N ranked sets of observations 120.
For each input variable Varj the preprocessing module 140 determines the range of observations 120 in the ranked set that defines each percentile bin 150 for that input variable. Each percentile bin 150 will have at least K/M observations 120.
More particularly, for K observations Ox, x=1 to K ranked by the value of input variable Varj, the ith percentile bin Bj for this input variable Varj is Bj,i, and has as its minimum value 156 the observation Ox :
min(B.sub.j,i)=O.sub.(i*K)/M                               Eq. 1
The smallest value of the input variable in each of its (100/M)% ranked observations define a percentile bin 150 of that input variable, except for the lowest ranked bin, which is defined by the minimum value of the second lowest ranked bin.
The expected score 250 for each percentile bin 150 is then determined 316 by the preprocessing module 140 from the scores 220 in the percentile bin 150 as:
E(B.sub.j,i)=Mean(score|Var.sub.j >min(B.sub.j,i) Eq. 2
for i=1;
E(B.sub.n,i)=Mean(score|min(B.sub.j,i-1)>Var.sub.j >min(B.sub.j,i)Eq. 3
for 1<i≦(M-1);
where E(Bj,i) is the expected score 250 for percentile bin Bj,i. The lowest ranked bin is defined by the preprocessing module 140 by the minimum value of the second lowest ranked bin 150. Each percentile bin Bj,i has at least (K*i)/M scores 220 associated with it, excepts for the lowest ranked bin Bj,M, which may have less.
Where M is the number of percentile bins, the expected score 250 is determined by the preprocessing module 140 as the sum of scores 220 in the multiplied by (M/K). Note that the preprocessing module 140 does not store values of the input variable Varj itself in the percentile bin 150, but merely uses them to define the boundaries of the percentile bin 150. It should also be noted that the preprocessing module 140 may create the ranked set of observations 210 for each input variable prior to (separate loops), or in conjunction with (nested loops) with determining the percentile bin expected scores 250.
The preferred embodiment of the preprocessing module 140 uses a mean calculation for the expected score 250. However, the invention is not limited to the use of mean for the expected score 250, and other summary measures, such as mode, median, and the like may also be used.
In other words, the expected score 205 for a percentile bin 150 is the expected output for the neural network 180 given a value for the input variable that falls within the percentile bin 150.
The above rankings by the preprocessing module 140 can produce ties at the borders between percentile bins 150. This may happen, for example, for discrete input variables. It is preferable to keep all observations 120 of a particular value of an input variable in one percentile bin 150 or the other. To do so, the scores 220 of observations 150 for which value of the input variable is at minimum value 156, are moved into the next higher ranked percentile bin.
However, this adjustment results in un-even percentile bin sizes, that is an unequal number of observations 158 for each bin. If at least K/M observations 120 are needed in each percentile bin 150, and the number of observations 158 is less K/M, the preprocessing module 140 merges the lowest ranked percentile bin 150 with the second lowest ranked percentile bin 150. The the preprocessing module 140 determines the expected score 250 in the resulting percentile bin 150 as above, based on the number of observations 158 in the merged percentile bin 150. The merging of bins occurs with the lowest, and second lowest ranked bins because the highest ranked bin through the second lowest ranked bin have at least the minimum number of observations, since the bins are created from the "top" of the ranked set of observations. Thus, only the lowest ranked bin will have less than the minimum number of observations 210 prior to adjustment. This adjustable binning by the preprocessing module 140 is particularly useful for discrete input variables.
Accordingly, because M merely establishes a maximum number of percentile bins 150 for an input variable, each input variable may have any number percentile bins 150 such that the minimum number of observations are used to determine the expected score 250 for each bin.
In addition to determining an expected score 250, if target values 240 are available for the observations 120 (these may be subsequently obtained) then the preprocessing module 140 may determine an expected target 160 for for each percentile bin 150 in the manner described above for the expected score 250, using the target value 240 in place of the score 220. The expected score 250 for a percentile bin is tested by the preprocessing module 140 for how closely it approximates the expected target 240. One implementation of this determination by the preprocessing module 140 uses a simple a confidence threshold e, and determines whether:
|E(B.sub.j,i)-T(B.sub.ji)|≦e      Eq. 4
where T(Bj,i) is the expected target 160 for percentile bin Bj,i. If this value exceeds e for any percentile bin 150, then either the number of bins for the input variable is decreased, or K is increased, and the percentile bins for that input variable are redetermined.
In summary then, the primary association created by the preprocessing module 140 in the table of percentile bins 150 is the relationship between the expected score 250 and the explanation 230 for each percentile bin. The provision of a bin number 154 and variable number 152 is useful for indexing and retrieving the expected scores 250 and explanation codes 302. The number of observations 158 is useful for defining the percentile bins 150, and may be used to support the statistical accuracy of the expected score in a bin at some future time, though this is not necessary for the purposes of the invention. The minimum value 156 is used by the explanation module 170 to identify the appropriate percentile bin from which to retrieve the expected score 250.
FIG. 4 illustrates a sample table of percentile bins 150 for a single input variable, "SIC-- RISK" in the credit card fraud application, described with respect to FIG. 6, in association with explanations 230 for the percentile bins 150. Variable number "1" refers to the "SIC-- RISK" variable, as established in a separate table. The bin numbers 154 indicate that there are 5 percentile bins 150 for this input variable. Each percentile bin 150 has a number 156 of observations from the total set of observations 210 use to generate the table 150. An expected score 250 is held for each bin.
In FIG. 4, each of the percentile bins 150 includes an explanation code 302 to establish the association to a particular explanation 230 in the explanation table 300. Only a portion of the explanation table 300 Percentile bin 1 has an explanation 230 that observations 210 for which the value of the input variable SIC-- RISK was greater than 5.12 are credit card transactions taking place in a high risk type store. Note that bins no. 1 and 2 share the same explanation code 302 and hence explanation 230, even though they define separate bins 150.
Referring now FIG. 5, and generally to FIG. 1, the rapid determination of explanations 230 for a new observation by the explanation module 170 is follows. A new observation 210 is received into the database 120. The observation 210 may be entered as direct data input manually, or through automated data capture, for example, from a point of sale system, or other data acquisition device. The observation 210 has values for some number of the input variables (values for all input variables is not necessary, as with conventional neural nets).
The new observation 210 is provided to both the feedforward algorithm 190 and the explanation module 170. The feedforward algorithm 190 propagates the observation 210 through the neural network 180 and produces a score 220, and a target 240 which may be used to classify the observation 220 with a target variable 240. The score 220, and if appropriate, the target 240, is provided to both the explanation module 170 and the user interface 200.
The explanation module 170 traverses the table of percentile bins 150 for each of the input variables Varj for the observation 210. For the input variable Varj, the explanation module 170 traverses the percentile bins Bj,i, where i indexes the percentile bins 150 for input variable Varj in the table 150. The explanation module 170 compares Varj with the minimum value 156 for the percentile bin Bj,i to identify the percentile bin 150 that contains the value of that input variable. The expected score 250 for that particular percentile bin 150 is retrieved, and the difference between the expected score 250 and the actual score 220 determined. This may be notated as:
D.sub.j =|E(B.sub.j,i)-S|
where S is the score 220 from the new observation 120, and Dj is the difference. The difference result Dj is temporarily stored in a table 400 along with the explanation code 302 from the percentile bin Bj,i. The table 400 of difference results is not persistently stored, since it is useful only for the particular observation 210. This difference result Dj is determined for all of the input variables Varj.
Optionally, a confidence threshold c is used to select only those expected scores 250 that are sufficiently close to the actual score 220 that input variable may be said to have significantly contributed to the actual score 220. Thus, only those difference results Dj <c are stored in the table 400.
The resulting table 400 of difference results is then sorted, low values to high. The lowest difference result D1 represents the input variable that most strongly contributes to the actual score 220 for the observation, and further, the value of that input variable, as falling within a particular percentile bin, has a particular explanation 230 associated with it.
Accordingly, the explanation 230 that is associated with this difference result D1 is then retrieved from the explanation table 300. Optionally, the explanations 230 associated with the remaining difference results Dj in the table may also be retrieved. These difference results represent the input variable percentile bins 150 having the second through jth closest expected score 250.
These explanations 230 for the observation 210 are provided to the user interface 200. The user interface 200 provides the target 240 and explanation(s) 230 to the user. Optionally, the score 220 may also be provided to the user, if useful in a specific application. Alternatively, the explanation module 170 provides the explanation code 302 to the user interface 200, which then itself performs the lookup on the explanation table 300 for the explanation 230 corresponding to the explanation code. In either implementation, the user interface 200 provides the explanation 230 to the user of the input variables and values thereof that most significantly determined the score 220 for the observation 120.
This process of providing the explanations 230 for new observations 210 is extremely rapid since each observation 210 need only be scored once with the neural network 180, rather than N additional times, where N is the number of input variables, as with conventional systems. This makes a system operating in accordance with the present invention suitable for high throughput applications.
The user interface 200 and explanation module 170 are shown in FIG. 1 as part of the computer 110 that operates the other elements of the present invention. In preferred implementations, using networked computers, the user interface 200 and explanation module 170 are provided in a separate client computer, with the computer 110 acting as a server computer, and providing access to the table of percentile bins 150.
Referring now to FIG. 6, there is shown an example of a user interface 200 for providing explanations 230 for observations 210. The user interface 200 here is for an implementation of the system 100 in a credit card authorization and fraud detection system. The user interface 200 here shows various information about an individual card holder, such as account number, name, date of birth, address, and the like, that may be obtained from an enterprise database coupled to the system 100 (not shown in FIG. 1). In this application, an observation 210 is a credit card transaction subject to authorization and the determination of whether the transaction is fraudulent. Not shown, but applicable in this application, are input variables, such as the amount of purchase, the geographic location, prior purchase amounts, and the like for the observation 210. In this application domain, the target variable is whether the transaction is fraudulent. If this is the case, it is desirable to know the reason, or explanation of which input variable(s) was the primary reason for this determination. In FIG. 6, the user interface 200 illustrates the explanations 230 for one such determination. Here, the three most significant explanations 230 are provided. These explanations 230 are those associated with the percentile bins for the input variables of transaction dollar amount, previous transaction amounts, and geographic location of transaction. In this application, the actual score 220 produced by the neural network 180 is 699 and appears under the column FRD-- SCOR.

Claims (21)

I claim:
1. A computer system for providing explanations of scored values for observations, comprising:
a database storing a plurality of observations, each observation having a value for each of a plurality of same input variables, and a score value for an output variable produced by a neural network;
a neural network that receives observations and produces an output score for the observation based on the values of the input variables;
a computer readable memory including:
a table including for each of the input variables, a set of a variable number of percentile bins, each percentile bin of an input variable associated with an expected score value for the output variable, the expected score value determined by an average of score values of observations having values for the input variable within the percentile bin, at least one percentile bin of each input variable associated with an explanation of the percentile bin;
a computer program that receives a new observation and score value from the neural network and determines from the table the percentile bin with an expected score value that is closest to the score value of the new observation, and provides the explanation, if any associated, with the determined percentile bin to a user.
2. In a computer system including a database storing a plurality of observations, each observation having a value for selected ones of a plurality of input variables, and a score value for an output variable determined with respect to the values of the input variable, a computer implemented method of providing explanations for the score values for new observations, comprising:
storing a table including for each of the input variables, a variable number of percentile bins, each percentile bin of an input variable associated with an expected score value for the output variable, the expected score value determined as a function of score values of observations having values for the input variable within the percentile bin;
for each input variable, associating with each of selected ones of the percentile bins of the input variable, an explanation of the values for the input variable within the percentile bin;
receiving a new observation;
determining a score value for the new observation;
determining from the table the percentile bin with an expected score value that is closest to the score value of the new observation; and
providing to a user the explanation, if any, associated with the determined percentile bin.
3. The method of claim 2, wherein storing a table includes:
for each input variable:
ranking the score values of the observations according to the value of the input variable for the observation;
determining a number of percentile bins for the input variable, each percentile bin having a ranking; and
determining for each percentile bin an expected score value for the output variable for all observations having values of the input variable within the percentile bin.
4. The method of claim 3, further comprising:
determining whether the number of observations in a lowest ranked bin is less than a minimum number of observations; and
responsive to the number of observations being less than the minimum number of observations, merging the lowest ranked bin with a second lowest ranked bin, and redetermining the expected score value for the output variable for all observations having values of the input variable within the merged percentile bins.
5. The method of claim 2, further comprising:
receiving for each observation a value of target variable;
for each input variable:
determining for each percentile bin of the input variable an expected target value for the target variable for all observations having values of the input variable within the percentile bin;
determining whether the expected score value for the output variable substantially approximates the expected target value;
responsive to the expected score value not substantially approximating the expected target value, either:
increasing the number of observations and redetermining the expected scores for each percentile bin; or,
decreasing the number of percentile bins and redetermining the expected scores for each remaining percentile bin.
6. The method of claim 2 wherein determining from the table the percentile bin with an expected score value that is closest to the score value of the new observation comprises:
receiving the observation;
for each input variable in the observation:
obtaining an expected score from the percentile bin of the input variable within the table that contains the value of the input variable; and
selecting from the obtained expected scores the expected score that is closest to the score value of the new observation.
7. The method of claim 6, further comprising:
selecting from the obtained expected scores, the expected scores that are within a threshold value of the actual score; and
providing for each of the selected scores, an explanation associated with the percentile bin from which the expected score was obtained to explain each of the selected scores.
8. In a computer system including a database storing a plurality of observations, each observation having a value for selected ones of a plurality of input variables, and a score value for an output variable determined with respect to the values of the input variable, a computer implemented method of providing explanations for the score values for new observations, comprising:
creating a table including for each of the input variables, a set of a variable number of percentile bins, each percentile bin of an input variable associated with an expected score value for the output variable, the expected score value determined by averaging score values of observations having values for the input variable within the percentile bin, wherein the table is created by:
for each input variable:
ranking the score values of the observations according to the value of the input variable for the observation;
determining a number of percentile bins for the input variable, each percentile bin having a ranking;
determining for each percentile bin an expected score value for the output variable for all observations having values of the input variable within the percentile bin; and
determining whether the percentile bin is to be associated with an explanation, and if so, associating the percentile bin with an explanation;
receiving a new observation;
determining a score value for the new observation;
determining from the table the percentile bin with an expected score value that is closest to the score value of the new observation;
providing to a user the explanation, if any, associated with the determined percentile bin;
receiving for each observation a value of a target variable; and
for each input variable:
determining for each percentile bin of the input variable an expected target value for the target variable for all observations having values of the input variable within the percentile bin;
determining whether the expected score value for the output variable substantially approximates the expected target value;
responsive to the expected score value not substantially approximating the expected target value, either:
increasing the number of observations and redetermining the expected scores for each percentile bin; or,
decreasing the number of percentile bins and redetermining the expected scores for each remaining percentile bin.
9. In a computer system including a database storing a plurality of observations, each observation having a value for selected ones of a plurality of input variables, and a score value for an output variable determined with respect to the values of the input variable, a computer implemented method of providing explanations for the score values for new observations, comprising:
creating a table including for each of the input variables, a set of a variable number of percentile bins, each percentile bin of an input variable associated with an expected score value for the output variable, the expected score value determined by averaging score values of observations having values for the input variable within the percentile bin, wherein the table is created by:
for each input variable:
ranking the score values of the observations according to the value of the input variable for the observation;
determining a number of percentile bins for the input variable, each percentile bin having a ranking;
determining for each percentile bin an expected score value for the output variable for all observations having values of the input variable within the percentile bin; and
determining whether the percentile bin is to be associated with an explanation, and if so, associating the percentile bin with an explanation;
receiving a new observation;
determining a score value for the new observation;
determining from the table the percentile bin with an expected score value that is closest to the score value of the new observation by:
for each input variable in the new observation, obtaining an expected score from the percentile bin of the input variable within the table that contains the value of the input variable;
selecting from the obtained expected scores the expected score that is closest to the score value of the new observation;
selecting the percentile bin having the closest expected score; and
providing to a user the explanation, if any, associated with the selected percentile bin.
10. The method of claim 9, further comprising:
selecting from the obtained expected scores, the expected scores that are within a threshold value of the actual score; and
providing for each of the selected scores, an explanation associated with the percentile bin from which the expected score was obtained.
11. In a computer system including a database storing a plurality of observations, each observation having a value for selected ones of a plurality of input variables, and a score value for an output variable determined with respect to the values of the input variable, a computer readable memory storing a computer program therein for configuring a processor of the computer system to create a table of percentile bins associated with explanations for the input variables for providing explanations for the score values for new observations, the computer program configuring the processor to:
provide a score value for the output variable of a selected number of observations based on the values of the input variables of each observation; and
for each input variable:
rank the score values of the observations according to the value of the input variable for the observation;
determine a number of percentile bins for the input variable, each percentile bin having a ranking;
determine for each percentile bin an expected score value for the output variable for all observations having values of the input variable within the percentile bin; and
determine whether the percentile bin is to be associated with an explanation, and if so, associate the percentile bin with an explanation of the input variable for the percentile bin and the values of the input variable within the percentile bin.
12. The computer readable memory of claim 11, further configuring the processor to:
determine whether the number of observations in a lowest ranked bin is less than a minimum number of observations; and
responsive to the number of observations being less than the minimum number of observations, merge the lowest ranked bin with a second lowest ranked bin, and redetermine the expected score value for the output variable for all observations having values of the input variable within the merged percentile bins.
13. A computer readable memory, for use with a processor and a database storing a plurality of observations, each observation having a value for selected ones of a plurality of same input variables, and a score value for an output variable determined with respect to the values of the input variable; and a table including for each of the input variables, a set of a variable number of percentile bins, each percentile bin of an input variable associated with an expected score value for the output variable, the expected score value determined as a function of score values of observations having values for the input variable within the percentile bin, at least one percentile bin of each input variable in the table associated with an explanation of the input variables and the values of the input variable within the percentile bin, the computer readable memory storing a computer program for configuring and controlling the processor to provide explanations for the score values for new observations by performing the steps of:
receiving a new observation;
determining a score value for the new observation;
determining from the table the percentile bin with an expected score value that is closest to the score value of the new observation; and
providing to a user the explanation, if any, associated with the determined percentile bin.
14. A computer system for identifying at least one input variable significantly contributing to a score value for an individual observation, the system comprising:
a database storing a plurality of training observations, each training observation having a value for each of a plurality of input variables, and a score value for an output variable;
a scoring module that receives observations, and produces a score value for the output variable for the observation based on values of the input variables for the observation; and
a computer readable memory including:
a table including for each of the input variables, a variable number of percentile bins, each percentile bin of an input variable associated with an expected score value for the output variable, the expected score value determined as a function of score values of training observations having values for the input variable within the percentile bin; and
a computer program that receives a new, individual observation and score value from the scoring module and selects from the table the percentile bin with an expected score value that is closest to the score value of the new, individual observation, and identifies the input variable associated with the selected percentile bin.
15. In a computer system including a database storing a plurality of observations, each observation having a value for selected ones of a plurality of input variables, and a score value for an output variable determined with respect to the values of the input variable, a computer implemented method of identifying at least one input variable that significantly contributes to the score value for a new observation, comprising:
storing a table including for each of the input variables, a variable number of percentile bins, each percentile bin of an input variable associated with an expected score value for the output variable, the expected score value determined as a function of score values of observations having values for the input variable within the percentile bin;
receiving a new observation;
determining a score value for the new observation;
selecting from the table the percentile bin with an expected score value that is closest to the score value of the new observation; and
identifying the input variable associated with the selected percentile bin.
16. A computer implemented method of identifying an input variable that significantly contributes to a score value of an observation, comprising:
receiving a plurality of training observations, each training observation having a value for each input variable, and a score value of an output variable determined as a function of the values of the input variables;
for each input variable:
sorting the training observations according to the value of the input variable;
segregating the sorted training observations into subsets;
determining for each subset, an expected score value determined as a function of the score values of the training observations in the subset;
receiving a new observation having values for the input variables;
scoring the new observation to produce a new score value; and
comparing the new score value with the expected score values of the subsets of training observations for the input variables, to identify the input variable having a subset of training observations with an expected score value closest to the new score value of the new observation.
17. The computer implemented method of claim 16, further comprising:
for each input variable, associating with each subset of training scores, an explanation of the input variable and the values of the input variable in the subset; and
responsive to identifying the input variable having a subset of training observations with an expected score value closest to the new score value of the new observation, providing to a user the explanation associated with the subset of training observations.
18. In a computer system including a database storing a plurality of observations, each observation having a value for selected ones of a plurality of input variables, and a neural network score value for an output variable determined with respect to the values of the input variable, a computer implemented method of providing explanations for the neural network score values for new observations, comprising:
storing a table including for each of the input variables, a variable number of percentile bins, each percentile bin of an input variable associated with an expected score value for the output variable, the expected score value determined as a function of neural network score values of observations having values for the input variable within the percentile bin;
for each input variable, associating with each of selected ones of the percentile bins of the input variable, an explanation of the values for the input variable within the percentile bin;
receiving a new observation;
determining a neural network score value for the new observation;
determining from the table the percentile bin with an expected score value that is closest to the neural network score value of the new observation; and
providing to a user the explanation, if any, associated with the determined percentile bin.
19. A computer readable memory, for use with a processor and a database storing a plurality of observations, each observation having a value for selected ones of a plurality of same input variables, and a neural network score value for an output variable determined with respect to the values of the input variable, the computer readable memory storing a computer program for configuring and controlling the processor to create a table of percentile bins associated with explanations for the input variables, for providing explanations for the neural network score values for new observations by performing the steps of:
providing a neural network score value for the output variable of a selected number of observations based on the values of the input variables of each observation;
for each input variable:
ranking the neural network score values of the observations according to the value of the input variable for the observation;
determining a number of percentile bins for the input variable, each percentile bin having a ranking;
determining for each percentile bin an expected neural network score value for the output variable for all observations having values of the input variable within the percentile bin; and
determining whether the percentile bin is to be associated with an explanation, and if so, associate the percentile bin with an explanation of the input variable for the percentile bin and the values of the input variable within the percentile bin.
20. A computer readable memory, for use with a processor and a database storing a plurality of observations, each observation having a value for selected ones of a plurality of same input variables, and a neural network score value for an output variable determined with respect to the values of the input variable; and a table including for each of the input variables, a set of a variable number of percentile bins, each percentile bin of an input variable associated with an expected neural network score value for the output variable, the expected neural network score value determined as a function of score values of observations having values for the input variable within the percentile bin, at least one percentile bin of each input variable in the table associated with an explanation of the input variables and the values of the input variable within the percentile bin, the computer readable memory storing a computer program for configuring and controlling the processor to provide explanations for the neural network score values for new observations by performing the steps of:
receiving a new observation;
determining a neural network score value for the new observation;
determining from the table the percentile bin with an expected neural network score value that is closest to the neural network score value of the new observation; and
providing to a user the explanation, if any, associated with the determined percentile bin.
21. In a computer system including a database storing a plurality of observations, each observation having a value for selected ones of a plurality of input variables, and a neural network score value for an output variable determined with respect to the values of the input variable, a computer implemented method of identifying at least one input variable that significantly contributes to the neural network score value for a new observation, comprising:
storing a table including for each of the input variables, a variable number of percentile bins, each percentile bin of an input variable associated with an expected score value for the output variable, the expected score value determined as a function of neural network score values of observations having values for the input variable within the percentile bin;
receiving a new observation;
determining a neural network score value for the new observation;
selecting from the table the percentile bin with an expected score value that is closest to the neural network score value of the new observation; and
identifying the input variable associated with the selected percentile bin.
US08/600,858 1996-02-13 1996-02-13 Fast explanations of scored observations Expired - Lifetime US5745654A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US08/600,858 US5745654A (en) 1996-02-13 1996-02-13 Fast explanations of scored observations

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US08/600,858 US5745654A (en) 1996-02-13 1996-02-13 Fast explanations of scored observations

Publications (1)

Publication Number Publication Date
US5745654A true US5745654A (en) 1998-04-28

Family

ID=24405340

Family Applications (1)

Application Number Title Priority Date Filing Date
US08/600,858 Expired - Lifetime US5745654A (en) 1996-02-13 1996-02-13 Fast explanations of scored observations

Country Status (1)

Country Link
US (1) US5745654A (en)

Cited By (122)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6014645A (en) * 1996-04-19 2000-01-11 Block Financial Corporation Real-time financial card application system
WO2001065440A1 (en) * 2000-02-28 2001-09-07 Cardclues.Com, L.L.C. System and method for facilitating selection of a credit card
US6324524B1 (en) 1998-11-03 2001-11-27 Nextcard, Inc. Method and apparatus for an account level offer of credit and real time balance transfer
US20010056475A1 (en) * 1995-12-20 2001-12-27 Anderson William P. System for on-line financial services using distributed objects
US6405181B2 (en) 1998-11-03 2002-06-11 Nextcard, Inc. Method and apparatus for real time on line credit approval
US6422462B1 (en) * 1998-03-30 2002-07-23 Morris E. Cohen Apparatus and methods for improved credit cards and credit card transactions
US20030065563A1 (en) * 1999-12-01 2003-04-03 Efunds Corporation Method and apparatus for atm-based cross-selling of products and services
US6564195B1 (en) 1999-07-22 2003-05-13 Cerebrus Solutions Limited Data classifier output interpretation
US6567791B2 (en) * 1998-11-03 2003-05-20 Nextcard, Inc. Method and apparatus for a verifiable on line rejection of an application for credit
US6714918B2 (en) 2000-03-24 2004-03-30 Access Business Group International Llc System and method for detecting fraudulent transactions
US6775641B2 (en) 2000-03-09 2004-08-10 Smartsignal Corporation Generalized lensing angular similarity operator
US20040158524A1 (en) * 1995-10-11 2004-08-12 Block Financial Corporation Financial information access system
US20040225520A1 (en) * 2003-05-07 2004-11-11 Intelligent Wave, Inc. Fraud score calculating program, method of calculating fraud score, and fraud score calculating system for credit cards
US20040249866A1 (en) * 2000-03-10 2004-12-09 Hewlett-Packard Development Company, L.P. OLAP-based customer behavior profiling method and system
US20060149674A1 (en) * 2004-12-30 2006-07-06 Mike Cook System and method for identity-based fraud detection for transactions using a plurality of historical identity records
US7096192B1 (en) 1997-07-28 2006-08-22 Cybersource Corporation Method and system for detecting fraud in a credit card transaction over a computer network
US20070043577A1 (en) * 2005-08-16 2007-02-22 Sheldon Kasower Apparatus and method of enabling a victim of identity theft to resolve and prevent fraud
US20070282824A1 (en) * 2006-05-31 2007-12-06 Ellingsworth Martin E Method and system for classifying documents
US7333953B1 (en) 2000-10-31 2008-02-19 Wells Fargo Bank, N.A. Method and apparatus for integrated payments processing and decisioning for internet transactions
US20080091530A1 (en) * 2006-04-28 2008-04-17 Rockne Egnatios Methods and systems for providing cross-selling with online banking environments
US7383233B1 (en) * 1999-12-03 2008-06-03 General Electric Company Method of designing an electronic transaction system
US20080140576A1 (en) * 1997-07-28 2008-06-12 Michael Lewis Method and apparatus for evaluating fraud risk in an electronic commerce transaction
US20080215291A1 (en) * 2000-03-09 2008-09-04 Wegerich Stephan W Complex signal decomposition and modeling
US20080294540A1 (en) * 2007-05-25 2008-11-27 Celka Christopher J System and method for automated detection of never-pay data sets
US7458508B1 (en) 2003-05-12 2008-12-02 Id Analytics, Inc. System and method for identity-based fraud detection
US7562814B1 (en) 2003-05-12 2009-07-21 Id Analytics, Inc. System and method for identity-based fraud detection through graph anomaly detection
US7686214B1 (en) 2003-05-12 2010-03-30 Id Analytics, Inc. System and method for identity-based fraud detection using a plurality of historical identity records
US20100174638A1 (en) * 2009-01-06 2010-07-08 ConsumerInfo.com Report existence monitoring
US7841004B1 (en) 2007-04-05 2010-11-23 Consumerinfo.Com, Inc. Child identity monitor
US7865427B2 (en) 2001-05-30 2011-01-04 Cybersource Corporation Method and apparatus for evaluating fraud risk in an electronic commerce transaction
US7970722B1 (en) 1999-11-08 2011-06-28 Aloft Media, Llc System, method and computer program product for a collaborative decision platform
US20110172504A1 (en) * 2010-01-14 2011-07-14 Venture Gain LLC Multivariate Residual-Based Health Index for Human Health Monitoring
US7991689B1 (en) 2008-07-23 2011-08-02 Experian Information Solutions, Inc. Systems and methods for detecting bust out fraud using credit data
US8010422B1 (en) 1998-11-03 2011-08-30 Nextcard, Llc On-line balance transfers
US8036978B1 (en) * 1999-12-31 2011-10-11 Pitney Bowes Inc. Method of upgrading third party functionality in an electronic fraud management system
US20110307475A1 (en) * 2010-06-15 2011-12-15 Sas Institute Inc. Techniques to find percentiles in a distributed computing environment
US20120023022A1 (en) * 2009-03-20 2012-01-26 Pat Carroll Card-present security system
US8145567B2 (en) 2000-10-31 2012-03-27 Wells Fargo Bank, N.A. Transaction ID system and process
US8190513B2 (en) 1996-06-05 2012-05-29 Fraud Control Systems.Com Corporation Method of billing a purchase made over a computer network
US8214262B1 (en) 2006-12-04 2012-07-03 Lower My Bills, Inc. System and method of enhancing leads
US8229844B2 (en) 1996-06-05 2012-07-24 Fraud Control Systems.Com Corporation Method of billing a purchase made over a computer network
US8275577B2 (en) 2006-09-19 2012-09-25 Smartsignal Corporation Kernel-based method for detecting boiler tube leaks
US20120271872A1 (en) * 2009-06-21 2012-10-25 Tom Chau Procedure for denoising dual-axis swallowing accelerometry signals
US8311774B2 (en) 2006-12-15 2012-11-13 Smartsignal Corporation Robust distance measures for on-line monitoring
US8386377B1 (en) 2003-05-12 2013-02-26 Id Analytics, Inc. System and method for credit scoring using an identity network connectivity
US8606626B1 (en) 2007-01-31 2013-12-10 Experian Information Solutions, Inc. Systems and methods for providing a direct marketing campaign planning environment
US8620853B2 (en) 2011-07-19 2013-12-31 Smartsignal Corporation Monitoring method using kernel regression modeling with pattern sequences
US8630942B2 (en) 1996-06-05 2014-01-14 Fraud Control Systems.Com Corporation Method of billing a purchase made over a computer network
US8660980B2 (en) 2011-07-19 2014-02-25 Smartsignal Corporation Monitoring system using kernel regression modeling with pattern sequences
US8738732B2 (en) 2005-09-14 2014-05-27 Liveperson, Inc. System and method for performing follow up based on user interactions
US8762313B2 (en) 2008-07-25 2014-06-24 Liveperson, Inc. Method and system for creating a predictive model for targeting web-page to a surfer
US8799200B2 (en) 2008-07-25 2014-08-05 Liveperson, Inc. Method and system for creating a predictive model for targeting webpage to a surfer
US8805844B2 (en) 2008-08-04 2014-08-12 Liveperson, Inc. Expert search
US8805941B2 (en) 2012-03-06 2014-08-12 Liveperson, Inc. Occasionally-connected computing interface
US8868448B2 (en) 2000-10-26 2014-10-21 Liveperson, Inc. Systems and methods to facilitate selling of products and services
US8918891B2 (en) 2012-06-12 2014-12-23 Id Analytics, Inc. Identity manipulation detection system and method
US8918465B2 (en) 2010-12-14 2014-12-23 Liveperson, Inc. Authentication of service requests initiated from a social networking site
US8943002B2 (en) 2012-02-10 2015-01-27 Liveperson, Inc. Analytics driven engagement
US20150170149A1 (en) * 2013-12-18 2015-06-18 Verizon Patent And Licensing Inc. Financial authorization of an online transaction based on a location and an identifier of a user device
US9110916B1 (en) 2006-11-28 2015-08-18 Lower My Bills, Inc. System and method of removing duplicate leads
US9152727B1 (en) 2010-08-23 2015-10-06 Experian Marketing Solutions, Inc. Systems and methods for processing consumer information for targeted marketing applications
US20150339769A1 (en) * 2014-05-22 2015-11-26 C1 Bank System and method for enforcing data integrity and loan approval automation by means of data aggregation and analysis
US9250625B2 (en) 2011-07-19 2016-02-02 Ge Intelligent Platforms, Inc. System of sequential kernel regression modeling for forecasting and prognostics
US9256224B2 (en) 2011-07-19 2016-02-09 GE Intelligent Platforms, Inc Method of sequential kernel regression modeling for forecasting and prognostics
US9350598B2 (en) 2010-12-14 2016-05-24 Liveperson, Inc. Authentication of service requests using a communications initiation feature
US9432468B2 (en) 2005-09-14 2016-08-30 Liveperson, Inc. System and method for design and dynamic generation of a web page
US9563916B1 (en) 2006-10-05 2017-02-07 Experian Information Solutions, Inc. System and method for generating a finance attribute from tradeline data
US9563336B2 (en) 2012-04-26 2017-02-07 Liveperson, Inc. Dynamic user interface customization
US9576030B1 (en) 2014-05-07 2017-02-21 Consumerinfo.Com, Inc. Keeping up with the joneses
US9595051B2 (en) 2009-05-11 2017-03-14 Experian Marketing Solutions, Inc. Systems and methods for providing anonymized user profile data
US9633322B1 (en) 2013-03-15 2017-04-25 Consumerinfo.Com, Inc. Adjustment of knowledge-based authentication
US9652802B1 (en) 2010-03-24 2017-05-16 Consumerinfo.Com, Inc. Indirect monitoring and reporting of a user's credit data
US9672196B2 (en) 2012-05-15 2017-06-06 Liveperson, Inc. Methods and systems for presenting specialized content using campaign metrics
US9767309B1 (en) 2015-11-23 2017-09-19 Experian Information Solutions, Inc. Access control system for implementing access restrictions of regulated database records while identifying and providing indicators of regulated database records matching validation criteria
US9767212B2 (en) 2010-04-07 2017-09-19 Liveperson, Inc. System and method for dynamically enabling customized web content and applications
US9819561B2 (en) 2000-10-26 2017-11-14 Liveperson, Inc. System and methods for facilitating object assignments
US9892417B2 (en) 2008-10-29 2018-02-13 Liveperson, Inc. System and method for applying tracing tools for network locations
US20180157661A1 (en) * 2016-12-02 2018-06-07 Fair Isaac Corporation Fast automatic explanation of scored observations
US10078868B1 (en) 2007-01-31 2018-09-18 Experian Information Solutions, Inc. System and method for providing an aggregation tool
US10102536B1 (en) 2013-11-15 2018-10-16 Experian Information Solutions, Inc. Micro-geographic aggregation system
US10242019B1 (en) 2014-12-19 2019-03-26 Experian Information Solutions, Inc. User behavior segmentation using latent topic detection
US10255598B1 (en) 2012-12-06 2019-04-09 Consumerinfo.Com, Inc. Credit card account data extraction
US10262362B1 (en) 2014-02-14 2019-04-16 Experian Information Solutions, Inc. Automatic generation of code for attributes
US10278065B2 (en) 2016-08-14 2019-04-30 Liveperson, Inc. Systems and methods for real-time remote control of mobile applications
US10339527B1 (en) 2014-10-31 2019-07-02 Experian Information Solutions, Inc. System and architecture for electronic fraud detection
US10373198B1 (en) 2008-06-13 2019-08-06 Lmb Mortgage Services, Inc. System and method of generating existing customer leads
US10417704B2 (en) 2010-11-02 2019-09-17 Experian Technology Ltd. Systems and methods of assisted strategy design
US10453093B1 (en) 2010-04-30 2019-10-22 Lmb Mortgage Services, Inc. System and method of optimizing matching of leads
US10521857B1 (en) 2003-05-12 2019-12-31 Symantec Corporation System and method for identity-based fraud detection
US10528545B1 (en) 2007-09-27 2020-01-07 Experian Information Solutions, Inc. Database system for triggering event notifications based on updates to database records
US10565643B2 (en) 2002-05-30 2020-02-18 Consumerinfo.Com, Inc. Systems and methods of presenting simulated credit score information
US10586279B1 (en) 2004-09-22 2020-03-10 Experian Information Solutions, Inc. Automated analysis of data to generate prospect notifications based on trigger events
US10586280B2 (en) 2018-01-30 2020-03-10 PointPredictive Inc. Risk-based machine learning classsifier
US10593004B2 (en) 2011-02-18 2020-03-17 Csidentity Corporation System and methods for identifying compromised personally identifiable information on the internet
US10592982B2 (en) 2013-03-14 2020-03-17 Csidentity Corporation System and method for identifying related credit inquiries
US10671749B2 (en) 2018-09-05 2020-06-02 Consumerinfo.Com, Inc. Authenticated access and aggregation database platform
US10678894B2 (en) 2016-08-24 2020-06-09 Experian Information Solutions, Inc. Disambiguation and authentication of device users
US10699028B1 (en) 2017-09-28 2020-06-30 Csidentity Corporation Identity security architecture systems and methods
US10757154B1 (en) 2015-11-24 2020-08-25 Experian Information Solutions, Inc. Real-time event-based notification system
US10810605B2 (en) 2004-06-30 2020-10-20 Experian Marketing Solutions, Llc System, method, software and data structure for independent prediction of attitudinal and message responsiveness, and preferences for communication media, channel, timing, frequency, and sequences of communications, using an integrated data repository
US10846295B1 (en) 2019-08-08 2020-11-24 Applied Underwriters, Inc. Semantic analysis system for ranking search results
US10869253B2 (en) 2015-06-02 2020-12-15 Liveperson, Inc. Dynamic communication routing based on consistency weighting and routing rules
US10896472B1 (en) 2017-11-14 2021-01-19 Csidentity Corporation Security and identity verification system and architecture
US11030562B1 (en) 2011-10-31 2021-06-08 Consumerinfo.Com, Inc. Pre-data breach monitoring
US11151468B1 (en) 2015-07-02 2021-10-19 Experian Information Solutions, Inc. Behavior analysis using distributed representations of event data
US11157997B2 (en) 2006-03-10 2021-10-26 Experian Information Solutions, Inc. Systems and methods for analyzing data
US11176475B1 (en) 2014-03-11 2021-11-16 Applied Underwriters, Inc. Artificial intelligence system for training a classifier
US11227001B2 (en) 2017-01-31 2022-01-18 Experian Information Solutions, Inc. Massive scale heterogeneous data ingestion and user resolution
US11257117B1 (en) 2014-06-25 2022-02-22 Experian Information Solutions, Inc. Mobile device sighting location analytics and profiling system
US11386442B2 (en) 2014-03-31 2022-07-12 Liveperson, Inc. Online behavioral predictor
US11410230B1 (en) 2015-11-17 2022-08-09 Consumerinfo.Com, Inc. Realtime access and control of secure regulated data
US11449850B2 (en) 2009-01-28 2022-09-20 Validsoft Limited Card false-positive prevention
US11682041B1 (en) 2020-01-13 2023-06-20 Experian Marketing Solutions, Llc Systems and methods of a tracking analytics platform
US11720527B2 (en) 2014-10-17 2023-08-08 Zestfinance, Inc. API for implementing scoring functions
US11720962B2 (en) 2020-11-24 2023-08-08 Zestfinance, Inc. Systems and methods for generating gradient-boosted models with improved fairness
US11809434B1 (en) 2014-03-11 2023-11-07 Applied Underwriters, Inc. Semantic analysis system for ranking search results
US11816541B2 (en) 2019-02-15 2023-11-14 Zestfinance, Inc. Systems and methods for decomposition of differentiable and non-differentiable models
US11847574B2 (en) 2018-05-04 2023-12-19 Zestfinance, Inc. Systems and methods for enriching modeling tools and infrastructure with semantics
US11861691B1 (en) 2011-04-29 2024-01-02 Consumerinfo.Com, Inc. Exposing reporting cycle information
US11893466B2 (en) 2019-03-18 2024-02-06 Zestfinance, Inc. Systems and methods for model fairness
US11941650B2 (en) 2017-08-02 2024-03-26 Zestfinance, Inc. Explainable machine learning financial credit approval model for protected classes of borrowers
US11960981B2 (en) 2018-03-09 2024-04-16 Zestfinance, Inc. Systems and methods for providing machine learning model evaluation by using decomposition

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5398300A (en) * 1990-07-27 1995-03-14 Hnc, Inc. Neural network having expert system functionality
US5559925A (en) * 1994-06-24 1996-09-24 Apple Computer, Inc. Determining the useability of input signals in a data recognition system

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5398300A (en) * 1990-07-27 1995-03-14 Hnc, Inc. Neural network having expert system functionality
US5559925A (en) * 1994-06-24 1996-09-24 Apple Computer, Inc. Determining the useability of input signals in a data recognition system

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Klimasauskas, Casimir C. "Neural Nets Tell Why" Dr. Dobb's Journal, vol. 16, No. 4, pp. 16-24, Apr. 1991.
Klimasauskas, Casimir C. Neural Nets Tell Why Dr. Dobb s Journal, vol. 16, No. 4, pp. 16 24, Apr. 1991. *
Poh, Hean Lee and Jasic, Teo. "Forecasting and Analysis of Marketing Data Using Neural Networks: A Case of Advertising and Promotion Imapct" Artificial Intelligence for Applications 1995 Conference, pp. 224-230, 1995.
Poh, Hean Lee and Jasic, Teo. Forecasting and Analysis of Marketing Data Using Neural Networks: A Case of Advertising and Promotion Imapct Artificial Intelligence for Applications 1995 Conference, pp. 224 230, 1995. *

Cited By (286)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040158524A1 (en) * 1995-10-11 2004-08-12 Block Financial Corporation Financial information access system
US7523055B2 (en) 1995-10-11 2009-04-21 Block Financial Llc Financial information access system
US20010056475A1 (en) * 1995-12-20 2001-12-27 Anderson William P. System for on-line financial services using distributed objects
US7296065B2 (en) 1995-12-20 2007-11-13 Block Financial Corporation System for on-line financial services using distributed objects
US20060179117A1 (en) * 1995-12-20 2006-08-10 Block Financial Corporation System for on-line financial services using distributed objects
US7032008B2 (en) 1995-12-20 2006-04-18 Block Financial Corporation System for on-line financial services using distributed objects
US7310617B1 (en) 1996-04-19 2007-12-18 Block Financial Corporation Real-time financial card offer and application system
US6014645A (en) * 1996-04-19 2000-01-11 Block Financial Corporation Real-time financial card application system
US8190513B2 (en) 1996-06-05 2012-05-29 Fraud Control Systems.Com Corporation Method of billing a purchase made over a computer network
US8630942B2 (en) 1996-06-05 2014-01-14 Fraud Control Systems.Com Corporation Method of billing a purchase made over a computer network
US8229844B2 (en) 1996-06-05 2012-07-24 Fraud Control Systems.Com Corporation Method of billing a purchase made over a computer network
US7752084B2 (en) 1997-07-28 2010-07-06 Cybersource Corporation Method and system for detecting fraud in a credit card transaction over the internet
US7096192B1 (en) 1997-07-28 2006-08-22 Cybersource Corporation Method and system for detecting fraud in a credit card transaction over a computer network
US7970701B2 (en) 1997-07-28 2011-06-28 Cybersource Corporation Method and apparatus for evaluating fraud risk in an electronic commerce transaction
US20100063917A1 (en) * 1997-07-28 2010-03-11 John Philip Pettitt Method and system for detecting fraud in a credit card transaction over the internet
US8244629B2 (en) 1997-07-28 2012-08-14 Michael Lewis Method and apparatus for generating a bi-gram score in fraud risk analysis
US20080281743A1 (en) * 1997-07-28 2008-11-13 John Philip Pettit Method and system for detecting fraud in a credit card transaction over the internet
US7403922B1 (en) 1997-07-28 2008-07-22 Cybersource Corporation Method and apparatus for evaluating fraud risk in an electronic commerce transaction
US20080140576A1 (en) * 1997-07-28 2008-06-12 Michael Lewis Method and apparatus for evaluating fraud risk in an electronic commerce transaction
US6422462B1 (en) * 1998-03-30 2002-07-23 Morris E. Cohen Apparatus and methods for improved credit cards and credit card transactions
US7756781B2 (en) * 1998-11-03 2010-07-13 Nextcard, Llc Method and apparatus for a verifiable on line rejection of an applicant for credit
US20100262535A1 (en) * 1998-11-03 2010-10-14 Lent Jeremy R Method and apparatus for a verifiable on line rejection of an application for credit
US20070027785A1 (en) * 1998-11-03 2007-02-01 Nextcard, Inc. Method and apparatus for a verifiable on line rejection of an applicant for credit
US6567791B2 (en) * 1998-11-03 2003-05-20 Nextcard, Inc. Method and apparatus for a verifiable on line rejection of an application for credit
US20080270295A1 (en) * 1998-11-03 2008-10-30 Lent Jeremy R Method and Apparatus for Real Time Online Credit Approval
US8010422B1 (en) 1998-11-03 2011-08-30 Nextcard, Llc On-line balance transfers
US7505939B2 (en) * 1998-11-03 2009-03-17 Nextcard, Inc. Method and apparatus for a verifiable on line rejection of an applicant for credit
US20080270294A1 (en) * 1998-11-03 2008-10-30 Lent Jeremy R Method and Apparatus for a Verifiable On Line Rejection of an Applicant for Credit
US6324524B1 (en) 1998-11-03 2001-11-27 Nextcard, Inc. Method and apparatus for an account level offer of credit and real time balance transfer
US20040039687A1 (en) * 1998-11-03 2004-02-26 Nextcard, Inc. Method and apparatus for a verifiable on line rejection of an applicant for credit
US6405181B2 (en) 1998-11-03 2002-06-11 Nextcard, Inc. Method and apparatus for real time on line credit approval
US7143063B2 (en) 1998-11-03 2006-11-28 Nextcard, Inc. Method and apparatus for a verifiable on line rejection of an applicant for credit
US6564195B1 (en) 1999-07-22 2003-05-13 Cerebrus Solutions Limited Data classifier output interpretation
US8160988B1 (en) 1999-11-08 2012-04-17 Aloft Media, Llc System, method and computer program product for a collaborative decision platform
US8005777B1 (en) 1999-11-08 2011-08-23 Aloft Media, Llc System, method and computer program product for a collaborative decision platform
US7970722B1 (en) 1999-11-08 2011-06-28 Aloft Media, Llc System, method and computer program product for a collaborative decision platform
US20030065563A1 (en) * 1999-12-01 2003-04-03 Efunds Corporation Method and apparatus for atm-based cross-selling of products and services
US10776865B2 (en) 1999-12-01 2020-09-15 Efunds Corporation Method and apparatus for ATM-based cross-selling of products and services
US7383233B1 (en) * 1999-12-03 2008-06-03 General Electric Company Method of designing an electronic transaction system
US8036978B1 (en) * 1999-12-31 2011-10-11 Pitney Bowes Inc. Method of upgrading third party functionality in an electronic fraud management system
WO2001065440A1 (en) * 2000-02-28 2001-09-07 Cardclues.Com, L.L.C. System and method for facilitating selection of a credit card
US8239170B2 (en) 2000-03-09 2012-08-07 Smartsignal Corporation Complex signal decomposition and modeling
US20040260515A1 (en) * 2000-03-09 2004-12-23 Smartsignal Corporation Generalized lensing angular similarity operator
US20080215291A1 (en) * 2000-03-09 2008-09-04 Wegerich Stephan W Complex signal decomposition and modeling
US6775641B2 (en) 2000-03-09 2004-08-10 Smartsignal Corporation Generalized lensing angular similarity operator
US20040249866A1 (en) * 2000-03-10 2004-12-09 Hewlett-Packard Development Company, L.P. OLAP-based customer behavior profiling method and system
US8190554B2 (en) 2000-03-10 2012-05-29 Hewlett-Packard Development Company, L.P. OLAP-based customer behavior profiling method and system
US6917940B1 (en) * 2000-03-10 2005-07-12 Hewlett-Packard Development Company, L.P. Olap-based customer behavior profiling method and system
US6714918B2 (en) 2000-03-24 2004-03-30 Access Business Group International Llc System and method for detecting fraudulent transactions
US9819561B2 (en) 2000-10-26 2017-11-14 Liveperson, Inc. System and methods for facilitating object assignments
US10797976B2 (en) 2000-10-26 2020-10-06 Liveperson, Inc. System and methods for facilitating object assignments
US8868448B2 (en) 2000-10-26 2014-10-21 Liveperson, Inc. Systems and methods to facilitate selling of products and services
US9576292B2 (en) 2000-10-26 2017-02-21 Liveperson, Inc. Systems and methods to facilitate selling of products and services
US8204826B2 (en) 2000-10-31 2012-06-19 Wells Fargo Bank, N.A. Method and apparatus for integrated payments processing and decisioning for internet transactions
US8407145B1 (en) 2000-10-31 2013-03-26 Wells Fargo Bank, N.A. Transaction ID system and process
US8145567B2 (en) 2000-10-31 2012-03-27 Wells Fargo Bank, N.A. Transaction ID system and process
US7333953B1 (en) 2000-10-31 2008-02-19 Wells Fargo Bank, N.A. Method and apparatus for integrated payments processing and decisioning for internet transactions
US7865427B2 (en) 2001-05-30 2011-01-04 Cybersource Corporation Method and apparatus for evaluating fraud risk in an electronic commerce transaction
US8019678B2 (en) 2001-05-30 2011-09-13 Cybersource Corporation Method and apparatus for evaluating fraud risk in an electronic commerce transaction
US10565643B2 (en) 2002-05-30 2020-02-18 Consumerinfo.Com, Inc. Systems and methods of presenting simulated credit score information
US7386506B2 (en) * 2003-05-07 2008-06-10 Intelligent Wave Inc. Fraud score calculating program, method of calculating fraud score, and fraud score calculating system for credit cards
US20040225520A1 (en) * 2003-05-07 2004-11-11 Intelligent Wave, Inc. Fraud score calculating program, method of calculating fraud score, and fraud score calculating system for credit cards
US7458508B1 (en) 2003-05-12 2008-12-02 Id Analytics, Inc. System and method for identity-based fraud detection
US10521857B1 (en) 2003-05-12 2019-12-31 Symantec Corporation System and method for identity-based fraud detection
US8386377B1 (en) 2003-05-12 2013-02-26 Id Analytics, Inc. System and method for credit scoring using an identity network connectivity
US7562814B1 (en) 2003-05-12 2009-07-21 Id Analytics, Inc. System and method for identity-based fraud detection through graph anomaly detection
US7686214B1 (en) 2003-05-12 2010-03-30 Id Analytics, Inc. System and method for identity-based fraud detection using a plurality of historical identity records
US7793835B1 (en) 2003-05-12 2010-09-14 Id Analytics, Inc. System and method for identity-based fraud detection for transactions using a plurality of historical identity records
US11657411B1 (en) 2004-06-30 2023-05-23 Experian Marketing Solutions, Llc System, method, software and data structure for independent prediction of attitudinal and message responsiveness, and preferences for communication media, channel, timing, frequency, and sequences of communications, using an integrated data repository
US10810605B2 (en) 2004-06-30 2020-10-20 Experian Marketing Solutions, Llc System, method, software and data structure for independent prediction of attitudinal and message responsiveness, and preferences for communication media, channel, timing, frequency, and sequences of communications, using an integrated data repository
US10586279B1 (en) 2004-09-22 2020-03-10 Experian Information Solutions, Inc. Automated analysis of data to generate prospect notifications based on trigger events
US11861756B1 (en) 2004-09-22 2024-01-02 Experian Information Solutions, Inc. Automated analysis of data to generate prospect notifications based on trigger events
US11373261B1 (en) 2004-09-22 2022-06-28 Experian Information Solutions, Inc. Automated analysis of data to generate prospect notifications based on trigger events
US11562457B2 (en) 2004-09-22 2023-01-24 Experian Information Solutions, Inc. Automated analysis of data to generate prospect notifications based on trigger events
US20060149674A1 (en) * 2004-12-30 2006-07-06 Mike Cook System and method for identity-based fraud detection for transactions using a plurality of historical identity records
US20070043577A1 (en) * 2005-08-16 2007-02-22 Sheldon Kasower Apparatus and method of enabling a victim of identity theft to resolve and prevent fraud
US11743214B2 (en) 2005-09-14 2023-08-29 Liveperson, Inc. System and method for performing follow up based on user interactions
US8738732B2 (en) 2005-09-14 2014-05-27 Liveperson, Inc. System and method for performing follow up based on user interactions
US9590930B2 (en) 2005-09-14 2017-03-07 Liveperson, Inc. System and method for performing follow up based on user interactions
US11526253B2 (en) 2005-09-14 2022-12-13 Liveperson, Inc. System and method for design and dynamic generation of a web page
US10191622B2 (en) 2005-09-14 2019-01-29 Liveperson, Inc. System and method for design and dynamic generation of a web page
US9525745B2 (en) 2005-09-14 2016-12-20 Liveperson, Inc. System and method for performing follow up based on user interactions
US9432468B2 (en) 2005-09-14 2016-08-30 Liveperson, Inc. System and method for design and dynamic generation of a web page
US9948582B2 (en) 2005-09-14 2018-04-17 Liveperson, Inc. System and method for performing follow up based on user interactions
US11394670B2 (en) 2005-09-14 2022-07-19 Liveperson, Inc. System and method for performing follow up based on user interactions
US11157997B2 (en) 2006-03-10 2021-10-26 Experian Information Solutions, Inc. Systems and methods for analyzing data
US20080091530A1 (en) * 2006-04-28 2008-04-17 Rockne Egnatios Methods and systems for providing cross-selling with online banking environments
US8160957B2 (en) 2006-04-28 2012-04-17 Efunds Corporation Methods and systems for opening and funding a financial account online
US20080091593A1 (en) * 2006-04-28 2008-04-17 Rockne Egnatios Methods and systems for opening and funding a financial account online
US20080091591A1 (en) * 2006-04-28 2008-04-17 Rockne Egnatios Methods and systems for opening and funding a financial account online
US7849003B2 (en) 2006-04-28 2010-12-07 Efunds Corporation Methods and systems for opening and funding a financial account online
US20110047168A1 (en) * 2006-05-31 2011-02-24 Ellingsworth Martin E Method and system for classifying documents
US8255347B2 (en) 2006-05-31 2012-08-28 Hartford Fire Insurance Company Method and system for classifying documents
US7849030B2 (en) 2006-05-31 2010-12-07 Hartford Fire Insurance Company Method and system for classifying documents
US20070282824A1 (en) * 2006-05-31 2007-12-06 Ellingsworth Martin E Method and system for classifying documents
US8738552B2 (en) 2006-05-31 2014-05-27 Hartford Fire Insurance Company Method and system for classifying documents
US8275577B2 (en) 2006-09-19 2012-09-25 Smartsignal Corporation Kernel-based method for detecting boiler tube leaks
US11954731B2 (en) 2006-10-05 2024-04-09 Experian Information Solutions, Inc. System and method for generating a finance attribute from tradeline data
US11631129B1 (en) 2006-10-05 2023-04-18 Experian Information Solutions, Inc System and method for generating a finance attribute from tradeline data
US10963961B1 (en) 2006-10-05 2021-03-30 Experian Information Solutions, Inc. System and method for generating a finance attribute from tradeline data
US9563916B1 (en) 2006-10-05 2017-02-07 Experian Information Solutions, Inc. System and method for generating a finance attribute from tradeline data
US10121194B1 (en) 2006-10-05 2018-11-06 Experian Information Solutions, Inc. System and method for generating a finance attribute from tradeline data
US10204141B1 (en) 2006-11-28 2019-02-12 Lmb Mortgage Services, Inc. System and method of removing duplicate leads
US9110916B1 (en) 2006-11-28 2015-08-18 Lower My Bills, Inc. System and method of removing duplicate leads
US11106677B2 (en) 2006-11-28 2021-08-31 Lmb Mortgage Services, Inc. System and method of removing duplicate user records
US10255610B1 (en) 2006-12-04 2019-04-09 Lmb Mortgage Services, Inc. System and method of enhancing leads
US8214262B1 (en) 2006-12-04 2012-07-03 Lower My Bills, Inc. System and method of enhancing leads
US10977675B2 (en) 2006-12-04 2021-04-13 Lmb Mortgage Services, Inc. System and method of enhancing leads
US8311774B2 (en) 2006-12-15 2012-11-13 Smartsignal Corporation Robust distance measures for on-line monitoring
US10891691B2 (en) 2007-01-31 2021-01-12 Experian Information Solutions, Inc. System and method for providing an aggregation tool
US11176570B1 (en) 2007-01-31 2021-11-16 Experian Information Solutions, Inc. Systems and methods for providing a direct marketing campaign planning environment
US11803873B1 (en) 2007-01-31 2023-10-31 Experian Information Solutions, Inc. Systems and methods for providing a direct marketing campaign planning environment
US10311466B1 (en) 2007-01-31 2019-06-04 Experian Information Solutions, Inc. Systems and methods for providing a direct marketing campaign planning environment
US9916596B1 (en) 2007-01-31 2018-03-13 Experian Information Solutions, Inc. Systems and methods for providing a direct marketing campaign planning environment
US11443373B2 (en) 2007-01-31 2022-09-13 Experian Information Solutions, Inc. System and method for providing an aggregation tool
US10402901B2 (en) 2007-01-31 2019-09-03 Experian Information Solutions, Inc. System and method for providing an aggregation tool
US8606626B1 (en) 2007-01-31 2013-12-10 Experian Information Solutions, Inc. Systems and methods for providing a direct marketing campaign planning environment
US9508092B1 (en) 2007-01-31 2016-11-29 Experian Information Solutions, Inc. Systems and methods for providing a direct marketing campaign planning environment
US10078868B1 (en) 2007-01-31 2018-09-18 Experian Information Solutions, Inc. System and method for providing an aggregation tool
US11908005B2 (en) 2007-01-31 2024-02-20 Experian Information Solutions, Inc. System and method for providing an aggregation tool
US10692105B1 (en) 2007-01-31 2020-06-23 Experian Information Solutions, Inc. Systems and methods for providing a direct marketing campaign planning environment
US10650449B2 (en) 2007-01-31 2020-05-12 Experian Information Solutions, Inc. System and method for providing an aggregation tool
US7975299B1 (en) 2007-04-05 2011-07-05 Consumerinfo.Com, Inc. Child identity monitor
US7841004B1 (en) 2007-04-05 2010-11-23 Consumerinfo.Com, Inc. Child identity monitor
US20080294540A1 (en) * 2007-05-25 2008-11-27 Celka Christopher J System and method for automated detection of never-pay data sets
US8364588B2 (en) 2007-05-25 2013-01-29 Experian Information Solutions, Inc. System and method for automated detection of never-pay data sets
US9251541B2 (en) 2007-05-25 2016-02-02 Experian Information Solutions, Inc. System and method for automated detection of never-pay data sets
US11954089B2 (en) 2007-09-27 2024-04-09 Experian Information Solutions, Inc. Database system for triggering event notifications based on updates to database records
US11347715B2 (en) 2007-09-27 2022-05-31 Experian Information Solutions, Inc. Database system for triggering event notifications based on updates to database records
US10528545B1 (en) 2007-09-27 2020-01-07 Experian Information Solutions, Inc. Database system for triggering event notifications based on updates to database records
US11704693B2 (en) 2008-06-13 2023-07-18 Lmb Mortgage Services, Inc. System and method of generating existing customer leads
US10565617B2 (en) 2008-06-13 2020-02-18 Lmb Mortgage Services, Inc. System and method of generating existing customer leads
US10373198B1 (en) 2008-06-13 2019-08-06 Lmb Mortgage Services, Inc. System and method of generating existing customer leads
US7991689B1 (en) 2008-07-23 2011-08-02 Experian Information Solutions, Inc. Systems and methods for detecting bust out fraud using credit data
US8001042B1 (en) 2008-07-23 2011-08-16 Experian Information Solutions, Inc. Systems and methods for detecting bust out fraud using credit data
US9396436B2 (en) 2008-07-25 2016-07-19 Liveperson, Inc. Method and system for providing targeted content to a surfer
US11263548B2 (en) 2008-07-25 2022-03-01 Liveperson, Inc. Method and system for creating a predictive model for targeting web-page to a surfer
US8762313B2 (en) 2008-07-25 2014-06-24 Liveperson, Inc. Method and system for creating a predictive model for targeting web-page to a surfer
US8799200B2 (en) 2008-07-25 2014-08-05 Liveperson, Inc. Method and system for creating a predictive model for targeting webpage to a surfer
US9396295B2 (en) 2008-07-25 2016-07-19 Liveperson, Inc. Method and system for creating a predictive model for targeting web-page to a surfer
US9336487B2 (en) 2008-07-25 2016-05-10 Live Person, Inc. Method and system for creating a predictive model for targeting webpage to a surfer
US9104970B2 (en) 2008-07-25 2015-08-11 Liveperson, Inc. Method and system for creating a predictive model for targeting web-page to a surfer
US11763200B2 (en) 2008-07-25 2023-09-19 Liveperson, Inc. Method and system for creating a predictive model for targeting web-page to a surfer
US8954539B2 (en) 2008-07-25 2015-02-10 Liveperson, Inc. Method and system for providing targeted content to a surfer
US11386106B2 (en) 2008-08-04 2022-07-12 Liveperson, Inc. System and methods for searching and communication
US10891299B2 (en) 2008-08-04 2021-01-12 Liveperson, Inc. System and methods for searching and communication
US8805844B2 (en) 2008-08-04 2014-08-12 Liveperson, Inc. Expert search
US9569537B2 (en) 2008-08-04 2017-02-14 Liveperson, Inc. System and method for facilitating interactions
US9558276B2 (en) 2008-08-04 2017-01-31 Liveperson, Inc. Systems and methods for facilitating participation
US9563707B2 (en) 2008-08-04 2017-02-07 Liveperson, Inc. System and methods for searching and communication
US9582579B2 (en) 2008-08-04 2017-02-28 Liveperson, Inc. System and method for facilitating communication
US9892417B2 (en) 2008-10-29 2018-02-13 Liveperson, Inc. System and method for applying tracing tools for network locations
US10867307B2 (en) 2008-10-29 2020-12-15 Liveperson, Inc. System and method for applying tracing tools for network locations
US11562380B2 (en) 2008-10-29 2023-01-24 Liveperson, Inc. System and method for applying tracing tools for network locations
US11978114B1 (en) 2009-01-06 2024-05-07 Consumerinfo.Com, Inc. Report existence monitoring
US10937090B1 (en) 2009-01-06 2021-03-02 Consumerinfo.Com, Inc. Report existence monitoring
US20100174638A1 (en) * 2009-01-06 2010-07-08 ConsumerInfo.com Report existence monitoring
US11449850B2 (en) 2009-01-28 2022-09-20 Validsoft Limited Card false-positive prevention
US20120023022A1 (en) * 2009-03-20 2012-01-26 Pat Carroll Card-present security system
US9595051B2 (en) 2009-05-11 2017-03-14 Experian Marketing Solutions, Inc. Systems and methods for providing anonymized user profile data
US20120271872A1 (en) * 2009-06-21 2012-10-25 Tom Chau Procedure for denoising dual-axis swallowing accelerometry signals
US8992446B2 (en) * 2009-06-21 2015-03-31 Holland Bloorview Kids Rehabilitation Hospital Procedure for denoising dual-axis swallowing accelerometry signals
US8620591B2 (en) 2010-01-14 2013-12-31 Venture Gain LLC Multivariate residual-based health index for human health monitoring
US20110172504A1 (en) * 2010-01-14 2011-07-14 Venture Gain LLC Multivariate Residual-Based Health Index for Human Health Monitoring
US9652802B1 (en) 2010-03-24 2017-05-16 Consumerinfo.Com, Inc. Indirect monitoring and reporting of a user's credit data
US10909617B2 (en) 2010-03-24 2021-02-02 Consumerinfo.Com, Inc. Indirect monitoring and reporting of a user's credit data
US11615161B2 (en) 2010-04-07 2023-03-28 Liveperson, Inc. System and method for dynamically enabling customized web content and applications
US9767212B2 (en) 2010-04-07 2017-09-19 Liveperson, Inc. System and method for dynamically enabling customized web content and applications
US11430009B2 (en) 2010-04-30 2022-08-30 Lmb Mortgage Services, Inc. System and method of optimizing matching of leads
US10453093B1 (en) 2010-04-30 2019-10-22 Lmb Mortgage Services, Inc. System and method of optimizing matching of leads
US8949249B2 (en) * 2010-06-15 2015-02-03 Sas Institute, Inc. Techniques to find percentiles in a distributed computing environment
US20110307475A1 (en) * 2010-06-15 2011-12-15 Sas Institute Inc. Techniques to find percentiles in a distributed computing environment
US9152727B1 (en) 2010-08-23 2015-10-06 Experian Marketing Solutions, Inc. Systems and methods for processing consumer information for targeted marketing applications
US10417704B2 (en) 2010-11-02 2019-09-17 Experian Technology Ltd. Systems and methods of assisted strategy design
US10038683B2 (en) 2010-12-14 2018-07-31 Liveperson, Inc. Authentication of service requests using a communications initiation feature
US10104020B2 (en) 2010-12-14 2018-10-16 Liveperson, Inc. Authentication of service requests initiated from a social networking site
US9350598B2 (en) 2010-12-14 2016-05-24 Liveperson, Inc. Authentication of service requests using a communications initiation feature
US11777877B2 (en) 2010-12-14 2023-10-03 Liveperson, Inc. Authentication of service requests initiated from a social networking site
US11050687B2 (en) 2010-12-14 2021-06-29 Liveperson, Inc. Authentication of service requests initiated from a social networking site
US8918465B2 (en) 2010-12-14 2014-12-23 Liveperson, Inc. Authentication of service requests initiated from a social networking site
US10593004B2 (en) 2011-02-18 2020-03-17 Csidentity Corporation System and methods for identifying compromised personally identifiable information on the internet
US11861691B1 (en) 2011-04-29 2024-01-02 Consumerinfo.Com, Inc. Exposing reporting cycle information
US8660980B2 (en) 2011-07-19 2014-02-25 Smartsignal Corporation Monitoring system using kernel regression modeling with pattern sequences
US8620853B2 (en) 2011-07-19 2013-12-31 Smartsignal Corporation Monitoring method using kernel regression modeling with pattern sequences
US9250625B2 (en) 2011-07-19 2016-02-02 Ge Intelligent Platforms, Inc. System of sequential kernel regression modeling for forecasting and prognostics
US9256224B2 (en) 2011-07-19 2016-02-09 GE Intelligent Platforms, Inc Method of sequential kernel regression modeling for forecasting and prognostics
US11568348B1 (en) 2011-10-31 2023-01-31 Consumerinfo.Com, Inc. Pre-data breach monitoring
US12045755B1 (en) 2011-10-31 2024-07-23 Consumerinfo.Com, Inc. Pre-data breach monitoring
US11030562B1 (en) 2011-10-31 2021-06-08 Consumerinfo.Com, Inc. Pre-data breach monitoring
US8943002B2 (en) 2012-02-10 2015-01-27 Liveperson, Inc. Analytics driven engagement
US8805941B2 (en) 2012-03-06 2014-08-12 Liveperson, Inc. Occasionally-connected computing interface
US11711329B2 (en) 2012-03-06 2023-07-25 Liveperson, Inc. Occasionally-connected computing interface
US11134038B2 (en) 2012-03-06 2021-09-28 Liveperson, Inc. Occasionally-connected computing interface
US10326719B2 (en) 2012-03-06 2019-06-18 Liveperson, Inc. Occasionally-connected computing interface
US9331969B2 (en) 2012-03-06 2016-05-03 Liveperson, Inc. Occasionally-connected computing interface
US11323428B2 (en) 2012-04-18 2022-05-03 Liveperson, Inc. Authentication of service requests using a communications initiation feature
US11689519B2 (en) 2012-04-18 2023-06-27 Liveperson, Inc. Authentication of service requests using a communications initiation feature
US10666633B2 (en) 2012-04-18 2020-05-26 Liveperson, Inc. Authentication of service requests using a communications initiation feature
US11868591B2 (en) 2012-04-26 2024-01-09 Liveperson, Inc. Dynamic user interface customization
US9563336B2 (en) 2012-04-26 2017-02-07 Liveperson, Inc. Dynamic user interface customization
US10795548B2 (en) 2012-04-26 2020-10-06 Liveperson, Inc. Dynamic user interface customization
US11269498B2 (en) 2012-04-26 2022-03-08 Liveperson, Inc. Dynamic user interface customization
US11004119B2 (en) 2012-05-15 2021-05-11 Liveperson, Inc. Methods and systems for presenting specialized content using campaign metrics
US9672196B2 (en) 2012-05-15 2017-06-06 Liveperson, Inc. Methods and systems for presenting specialized content using campaign metrics
US11687981B2 (en) 2012-05-15 2023-06-27 Liveperson, Inc. Methods and systems for presenting specialized content using campaign metrics
US8918891B2 (en) 2012-06-12 2014-12-23 Id Analytics, Inc. Identity manipulation detection system and method
US10255598B1 (en) 2012-12-06 2019-04-09 Consumerinfo.Com, Inc. Credit card account data extraction
US10592982B2 (en) 2013-03-14 2020-03-17 Csidentity Corporation System and method for identifying related credit inquiries
US11775979B1 (en) 2013-03-15 2023-10-03 Consumerinfo.Com, Inc. Adjustment of knowledge-based authentication
US9633322B1 (en) 2013-03-15 2017-04-25 Consumerinfo.Com, Inc. Adjustment of knowledge-based authentication
US10169761B1 (en) 2013-03-15 2019-01-01 ConsumerInfo.com Inc. Adjustment of knowledge-based authentication
US10740762B2 (en) 2013-03-15 2020-08-11 Consumerinfo.Com, Inc. Adjustment of knowledge-based authentication
US11288677B1 (en) 2013-03-15 2022-03-29 Consumerlnfo.com, Inc. Adjustment of knowledge-based authentication
US10580025B2 (en) 2013-11-15 2020-03-03 Experian Information Solutions, Inc. Micro-geographic aggregation system
US10102536B1 (en) 2013-11-15 2018-10-16 Experian Information Solutions, Inc. Micro-geographic aggregation system
US20150170149A1 (en) * 2013-12-18 2015-06-18 Verizon Patent And Licensing Inc. Financial authorization of an online transaction based on a location and an identifier of a user device
US9892402B2 (en) * 2013-12-18 2018-02-13 Verizon Patent And Licensing Inc. Financial authorization of an online transaction based on a location and an identifier of a user device
US11107158B1 (en) 2014-02-14 2021-08-31 Experian Information Solutions, Inc. Automatic generation of code for attributes
US10262362B1 (en) 2014-02-14 2019-04-16 Experian Information Solutions, Inc. Automatic generation of code for attributes
US11847693B1 (en) 2014-02-14 2023-12-19 Experian Information Solutions, Inc. Automatic generation of code for attributes
US11176475B1 (en) 2014-03-11 2021-11-16 Applied Underwriters, Inc. Artificial intelligence system for training a classifier
US11809434B1 (en) 2014-03-11 2023-11-07 Applied Underwriters, Inc. Semantic analysis system for ranking search results
US12079829B2 (en) 2014-03-31 2024-09-03 Liveperson, Inc. Online behavioral predictor
US11386442B2 (en) 2014-03-31 2022-07-12 Liveperson, Inc. Online behavioral predictor
US10936629B2 (en) 2014-05-07 2021-03-02 Consumerinfo.Com, Inc. Keeping up with the joneses
US10019508B1 (en) 2014-05-07 2018-07-10 Consumerinfo.Com, Inc. Keeping up with the joneses
US11620314B1 (en) 2014-05-07 2023-04-04 Consumerinfo.Com, Inc. User rating based on comparing groups
US9576030B1 (en) 2014-05-07 2017-02-21 Consumerinfo.Com, Inc. Keeping up with the joneses
US20150339769A1 (en) * 2014-05-22 2015-11-26 C1 Bank System and method for enforcing data integrity and loan approval automation by means of data aggregation and analysis
US11620677B1 (en) 2014-06-25 2023-04-04 Experian Information Solutions, Inc. Mobile device sighting location analytics and profiling system
US11257117B1 (en) 2014-06-25 2022-02-22 Experian Information Solutions, Inc. Mobile device sighting location analytics and profiling system
US11720527B2 (en) 2014-10-17 2023-08-08 Zestfinance, Inc. API for implementing scoring functions
US12099470B2 (en) 2014-10-17 2024-09-24 Zestfinance, Inc. API for implementing scoring functions
US10339527B1 (en) 2014-10-31 2019-07-02 Experian Information Solutions, Inc. System and architecture for electronic fraud detection
US11941635B1 (en) 2014-10-31 2024-03-26 Experian Information Solutions, Inc. System and architecture for electronic fraud detection
US10990979B1 (en) 2014-10-31 2021-04-27 Experian Information Solutions, Inc. System and architecture for electronic fraud detection
US11436606B1 (en) 2014-10-31 2022-09-06 Experian Information Solutions, Inc. System and architecture for electronic fraud detection
US11010345B1 (en) 2014-12-19 2021-05-18 Experian Information Solutions, Inc. User behavior segmentation using latent topic detection
US10242019B1 (en) 2014-12-19 2019-03-26 Experian Information Solutions, Inc. User behavior segmentation using latent topic detection
US10445152B1 (en) 2014-12-19 2019-10-15 Experian Information Solutions, Inc. Systems and methods for dynamic report generation based on automatic modeling of complex data structures
US10869253B2 (en) 2015-06-02 2020-12-15 Liveperson, Inc. Dynamic communication routing based on consistency weighting and routing rules
US11638195B2 (en) 2015-06-02 2023-04-25 Liveperson, Inc. Dynamic communication routing based on consistency weighting and routing rules
US11151468B1 (en) 2015-07-02 2021-10-19 Experian Information Solutions, Inc. Behavior analysis using distributed representations of event data
US12099940B1 (en) 2015-07-02 2024-09-24 Experian Information Solutions, Inc. Behavior analysis using distributed representations of event data
US11410230B1 (en) 2015-11-17 2022-08-09 Consumerinfo.Com, Inc. Realtime access and control of secure regulated data
US11893635B1 (en) 2015-11-17 2024-02-06 Consumerinfo.Com, Inc. Realtime access and control of secure regulated data
US9767309B1 (en) 2015-11-23 2017-09-19 Experian Information Solutions, Inc. Access control system for implementing access restrictions of regulated database records while identifying and providing indicators of regulated database records matching validation criteria
US10685133B1 (en) 2015-11-23 2020-06-16 Experian Information Solutions, Inc. Access control system for implementing access restrictions of regulated database records while identifying and providing indicators of regulated database records matching validation criteria
US11748503B1 (en) 2015-11-23 2023-09-05 Experian Information Solutions, Inc. Access control system for implementing access restrictions of regulated database records while identifying and providing indicators of regulated database records matching validation criteria
US10019593B1 (en) 2015-11-23 2018-07-10 Experian Information Solutions, Inc. Access control system for implementing access restrictions of regulated database records while identifying and providing indicators of regulated database records matching validation criteria
US11729230B1 (en) 2015-11-24 2023-08-15 Experian Information Solutions, Inc. Real-time event-based notification system
US11159593B1 (en) 2015-11-24 2021-10-26 Experian Information Solutions, Inc. Real-time event-based notification system
US10757154B1 (en) 2015-11-24 2020-08-25 Experian Information Solutions, Inc. Real-time event-based notification system
US10278065B2 (en) 2016-08-14 2019-04-30 Liveperson, Inc. Systems and methods for real-time remote control of mobile applications
US10678894B2 (en) 2016-08-24 2020-06-09 Experian Information Solutions, Inc. Disambiguation and authentication of device users
US11550886B2 (en) 2016-08-24 2023-01-10 Experian Information Solutions, Inc. Disambiguation and authentication of device users
US20240061849A1 (en) * 2016-12-02 2024-02-22 Fair Isaac Corporation Fast automatic explanation of scored observations
US11748360B2 (en) 2016-12-02 2023-09-05 Fair Isaac Corporation Fast automatic explanation of scored observations
US20180157661A1 (en) * 2016-12-02 2018-06-07 Fair Isaac Corporation Fast automatic explanation of scored observations
US11042551B2 (en) 2016-12-02 2021-06-22 Fair Isaac Corporation Fast automatic explanation of scored observations
US11227001B2 (en) 2017-01-31 2022-01-18 Experian Information Solutions, Inc. Massive scale heterogeneous data ingestion and user resolution
US11681733B2 (en) 2017-01-31 2023-06-20 Experian Information Solutions, Inc. Massive scale heterogeneous data ingestion and user resolution
US11941650B2 (en) 2017-08-02 2024-03-26 Zestfinance, Inc. Explainable machine learning financial credit approval model for protected classes of borrowers
US10699028B1 (en) 2017-09-28 2020-06-30 Csidentity Corporation Identity security architecture systems and methods
US11580259B1 (en) 2017-09-28 2023-02-14 Csidentity Corporation Identity security architecture systems and methods
US11157650B1 (en) 2017-09-28 2021-10-26 Csidentity Corporation Identity security architecture systems and methods
US10896472B1 (en) 2017-11-14 2021-01-19 Csidentity Corporation Security and identity verification system and architecture
US10733668B2 (en) 2018-01-30 2020-08-04 PointPredictive Inc. Multi-layer machine learning classifier
US10692141B2 (en) 2018-01-30 2020-06-23 PointPredictive Inc. Multi-layer machine learning classifier with correlative score
US10586280B2 (en) 2018-01-30 2020-03-10 PointPredictive Inc. Risk-based machine learning classsifier
US11321774B2 (en) 2018-01-30 2022-05-03 Pointpredictive, Inc. Risk-based machine learning classifier
US12131375B2 (en) 2018-01-30 2024-10-29 Pointpredictive, Inc. Risk-based machine learning classifier
US11960981B2 (en) 2018-03-09 2024-04-16 Zestfinance, Inc. Systems and methods for providing machine learning model evaluation by using decomposition
US11847574B2 (en) 2018-05-04 2023-12-19 Zestfinance, Inc. Systems and methods for enriching modeling tools and infrastructure with semantics
US10671749B2 (en) 2018-09-05 2020-06-02 Consumerinfo.Com, Inc. Authenticated access and aggregation database platform
US11265324B2 (en) 2018-09-05 2022-03-01 Consumerinfo.Com, Inc. User permissions for access to secure data at third-party
US12074876B2 (en) 2018-09-05 2024-08-27 Consumerinfo.Com, Inc. Authenticated access and aggregation database platform
US10880313B2 (en) 2018-09-05 2020-12-29 Consumerinfo.Com, Inc. Database platform for realtime updating of user data from third party sources
US11399029B2 (en) 2018-09-05 2022-07-26 Consumerinfo.Com, Inc. Database platform for realtime updating of user data from third party sources
US12131241B2 (en) 2019-02-15 2024-10-29 Zestfinance, Inc. Systems and methods for decomposition of differentiable and non-differentiable models
US11816541B2 (en) 2019-02-15 2023-11-14 Zestfinance, Inc. Systems and methods for decomposition of differentiable and non-differentiable models
US11893466B2 (en) 2019-03-18 2024-02-06 Zestfinance, Inc. Systems and methods for model fairness
US10846295B1 (en) 2019-08-08 2020-11-24 Applied Underwriters, Inc. Semantic analysis system for ranking search results
US11682041B1 (en) 2020-01-13 2023-06-20 Experian Marketing Solutions, Llc Systems and methods of a tracking analytics platform
US12002094B2 (en) 2020-11-24 2024-06-04 Zestfinance, Inc. Systems and methods for generating gradient-boosted models with improved fairness
US11720962B2 (en) 2020-11-24 2023-08-08 Zestfinance, Inc. Systems and methods for generating gradient-boosted models with improved fairness

Similar Documents

Publication Publication Date Title
US5745654A (en) Fast explanations of scored observations
US5197004A (en) Method and apparatus for automatic categorization of applicants from resumes
US7756804B2 (en) Automated model building and evaluation for data mining system
Dimitras et al. Business failure prediction using rough sets
US6988090B2 (en) Prediction analysis apparatus and program storage medium therefor
RU2510891C2 (en) Method and device for system used for forecasting of group trade
CN109739844B (en) Data classification method based on attenuation weight
US7512553B2 (en) System for automated part-number mapping
US5659731A (en) Method for rating a match for a given entity found in a list of entities
KR101462707B1 (en) Methods and systems for implementing approximate string matching within a database
US6542896B1 (en) System and method for organizing data
US9916584B2 (en) Method and system for automatic assignment of sales opportunities to human agents
AU2008203532A1 (en) Method and System for Processing Information
EP1043666A2 (en) A system for identification of selectively related database records
Gupta et al. Machine Learning based Credit Card Fraud Detection-A Review
WO2016016973A1 (en) Result evaluation device, control method for result evaluation device, and control program for result evaluation device
Pristyanto et al. The effect of feature selection on classification algorithms in credit approval
CN112506930B (en) Data insight system based on machine learning technology
Dorsey et al. Bankruptcy prediction using artificial neural systems
WO1992017853A2 (en) Direct data base analysis, forecasting and diagnosis method
Díaz et al. Some experiences applying fuzzy logic to economics
Karnila et al. MARKET BASKET ANALYSIS ON TRANSACTION DATA USING THE APRIORI ALGORITHM
Leung et al. Building a scorecard in practice
Chen Applications of fuzzy logic in data mining process
Phyo et al. Efficient K-Means Clustering Algorithm for Predicting of Students’ Academic Performance

Legal Events

Date Code Title Description
AS Assignment

Owner name: HNC SOFTWARE, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:TITAN, HARI;REEL/FRAME:007899/0244

Effective date: 19960417

STCF Information on status: patent grant

Free format text: PATENTED CASE

FPAY Fee payment

Year of fee payment: 4

AS Assignment

Owner name: FAIR ISAAC CORPORATION, CALIFORNIA

Free format text: MERGER;ASSIGNOR:HNC SOFTWARE, INC.;REEL/FRAME:014506/0405

Effective date: 20021031

FPAY Fee payment

Year of fee payment: 8

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 12