[go: up one dir, main page]
More Web Proxy on the site http://driver.im/

CN110264342A - A kind of business audit method and device based on machine learning - Google Patents

A kind of business audit method and device based on machine learning Download PDF

Info

Publication number
CN110264342A
CN110264342A CN201910533825.2A CN201910533825A CN110264342A CN 110264342 A CN110264342 A CN 110264342A CN 201910533825 A CN201910533825 A CN 201910533825A CN 110264342 A CN110264342 A CN 110264342A
Authority
CN
China
Prior art keywords
random forest
forest model
decision tree
audit
classification results
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910533825.2A
Other languages
Chinese (zh)
Other versions
CN110264342B (en
Inventor
刘冰
徐为恺
杨杨
江旻
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
WeBank Co Ltd
Original Assignee
WeBank Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by WeBank Co Ltd filed Critical WeBank Co Ltd
Priority to CN201910533825.2A priority Critical patent/CN110264342B/en
Publication of CN110264342A publication Critical patent/CN110264342A/en
Application granted granted Critical
Publication of CN110264342B publication Critical patent/CN110264342B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/004Artificial life, i.e. computing arrangements simulating life
    • G06N3/006Artificial life, i.e. computing arrangements simulating life based on simulated virtual individual or collective life forms, e.g. social simulations or particle swarm optimisation [PSO]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/03Credit; Loans; Processing thereof

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Finance (AREA)
  • Accounting & Taxation (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Biophysics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Biomedical Technology (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Technology Law (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The business audit method and device based on machine learning that the embodiment of the invention provides a kind of, it is related to financial technology technical field, this method comprises: using history service audit data training Random Forest model, make the audit behavior of Random Forest model study auditor, then by every decision tree in the feature vector input Random Forest model of business applicant, obtain the classification results of every decision tree output in Random Forest model, the corresponding weight of every decision tree in the classification results and Random Forest model exported later further according to every decision tree in Random Forest model, determine the auditing result of service request.For expert model, Random Forest model is obtained based on history service audit data training, is not merely the experience of professional auditor, therefore rely on artificial experience small, the influence for reducing subjective factor improves the generalization ability and versatility of audit model.

Description

A kind of business audit method and device based on machine learning
Technical field
The present embodiments relate to financial technology (Fintech) technical fields more particularly to a kind of based on machine learning Business audit method and device.
Background technique
With the development of computer technology, and more and more technical applications (such as: artificial intelligence, cloud computing, block chain Deng) in financial field, traditional financial industry is gradually changing to financial technology (Fintech), but due to the safety of financial industry Property, requirement of real-time, also to technology propose higher requirement.In financial industry, supply chain loan solves upstream and downstream enterprise The difficult problem of industry financing difficulties, guarantee, and by getting through upstream and downstream financing bottleneck, supply chain finance costs can also be reduced, Improve the competitiveness of core enterprise and mating enterprise.Currently, mainly loan is audited using expert model, it is more by collecting It is engaged in the experience of the business personnel of loan audit year, by summarizing, is summarized as the service logic rule of a set of fixation, passes through Regulation engine is deployed in auditing system, realizes automatic audit, and this method excessively relies on the experience of business personnel, subjective factor By force.
Summary of the invention
Due to excessively relying on the experience of business personnel using the scheme of expert model audit business, subjective factor is strong to be asked Topic, the business audit method and device based on machine learning that the embodiment of the invention provides a kind of.
On the one hand, the business audit method based on machine learning that the embodiment of the invention provides a kind of, comprising:
The service request of acquisition business applicant;
The feature vector of the business applicant is extracted according to the service request;
By every decision tree in the feature vector input Random Forest model of the business applicant, obtain described random The classification results that every decision tree exports in forest model, the Random Forest model are with history service audit data for training What sample training obtained;
According to the classification results that every decision tree in the Random Forest model exports, the audit of the service request is determined As a result.
Optionally, the classification results exported according to every decision tree in the Random Forest model, determine the industry The auditing result of business request, comprising:
It is every in the classification results and the Random Forest model exported according to every decision tree in the Random Forest model The corresponding weight of decision tree, determines the auditing result of the service request.
Optionally, described according to the classification results of every decision tree output in the Random Forest model and described random gloomy The corresponding weight of every decision tree in woods model, determines the auditing result of the service request, comprising:
The weight of the identical decision tree of classification results in the Random Forest model is added, determines each classification results Classified weight;
Using the maximum classification results of classified weight as auditing result.
Optionally, the service request is supply chain service request, and the history service audit data include Lian Shu enterprise Characteristic, core enterprise's characteristic, auditor history audit logging.
Optionally, the Random Forest model is obtained by training sample training of history service audit data, comprising:
It obtains history service and audits data;
Data, which are audited, according to the history service determines feature vector set;
N number of subcharacter vector set is extracted from described eigenvector set, the N is default positive integer;
N decision tree is obtained using N number of subcharacter vector set training;
N decision tree is formed into Random Forest model.
On the one hand, the embodiment of the invention provides a kind of business audit device based on machine learning, comprising:
Module is obtained, for obtaining the service request of business applicant;
Extraction module, for extracting the feature vector of the business applicant according to the service request;
Categorization module, for every decision in the feature vector input Random Forest model by the business applicant Tree obtains the classification results that every decision tree exports in the Random Forest model, and the Random Forest model is with history industry Business audit data are that training sample training obtains;
Processing module, for according in the Random Forest model every decision tree export classification results, determine described in The auditing result of service request.
Optionally, the processing module is specifically used for:
It is every in the classification results and the Random Forest model exported according to every decision tree in the Random Forest model The corresponding weight of decision tree, determines the auditing result of the service request.
Optionally, the processing module is specifically used for:
The weight of the identical decision tree of classification results in the Random Forest model is added, determines each classification results Classified weight;
Using the maximum classification results of classified weight as auditing result.
Optionally, the service request is supply chain service request, and the history service audit data include Lian Shu enterprise Characteristic, core enterprise's characteristic, auditor history audit logging.
Optionally, the categorization module is specifically used for:
It obtains history service and audits data;
Data, which are audited, according to the history service determines feature vector set;
N number of subcharacter vector set is extracted from described eigenvector set, the N is default positive integer;
N decision tree is obtained using N number of subcharacter vector set training;
N decision tree is formed into Random Forest model.
On the one hand, the embodiment of the invention provides a kind of computer equipment, including memory, processor and it is stored in storage On device and the computer program that can run on a processor, the processor are realized when executing described program based on machine learning The step of business audit method.
On the one hand, the embodiment of the invention provides a kind of computer readable storage medium, being stored with can be set by computer The standby computer program executed, when described program is run on a computing device, so that computer equipment execution is based on The step of business audit method of machine learning.
In the embodiment of the present invention, due to making random forest mould using history service audit data training Random Forest model Type learns the audit behavior of auditor, then by every decision in the feature vector input Random Forest model of business applicant Tree obtains the classification results of every decision tree output in Random Forest model, later certainly further according to every in Random Forest model The classification results of plan tree output, determine the auditing result of service request.For expert model, Random Forest model is base It is obtained in history service audit data training, is not merely the experience of professional auditor, therefore small to the dependence of artificial experience, The influence for reducing subjective factor improves the generalization ability and versatility of audit model.
Detailed description of the invention
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly introduced, it should be apparent that, drawings in the following description are only some embodiments of the invention, for this For the those of ordinary skill in field, without any creative labor, it can also be obtained according to these attached drawings His attached drawing.
Fig. 1 is a kind of application scenarios schematic diagram provided in an embodiment of the present invention;
Fig. 2 is a kind of flow diagram of the business audit method based on machine learning provided in an embodiment of the present invention;
Fig. 3 is a kind of flow diagram of the method for trained Random Forest model provided in an embodiment of the present invention;
Fig. 4 is a kind of flow diagram of the business audit method based on machine learning provided in an embodiment of the present invention;
Fig. 5 is a kind of structural schematic diagram of the business audit device based on machine learning provided in an embodiment of the present invention;
Fig. 6 is a kind of structural schematic diagram of computer equipment provided in an embodiment of the present invention.
Specific embodiment
In order to which the purpose of the present invention, technical solution and beneficial effect is more clearly understood, below in conjunction with attached drawing and implementation Example, the present invention will be described in further detail.It should be appreciated that specific embodiment described herein is only used to explain this hair It is bright, it is not intended to limit the present invention.
In order to facilitate understanding, noun involved in the embodiment of the present invention is explained below.
Supply chain loan: Supply Chain Finance, be in supply chain core enterprise and its it is relevant up and down You Lianshu enterprise as a whole, formulates according to the transaction relationship of enterprise in supply chain and industry characteristic based on goods power and cash A kind of Financing Mode of the whole finance resolution of flow control.
Core enterprise: in supply chain grasp core technology, core competence, core link enterprise.
Lian Shu enterprise: the upstream and downstream Lian Shu enterprise of core enterprise in supply chain.
The business audit method based on machine learning in the embodiment of the present invention can be applied to applied field as shown in Figure 1 Scape includes terminal device 101, audit server 102 in the application scenarios, wherein terminal device 101 can be intelligent hand Machine, tablet computer or portable personal computer etc..Audit server 102 can be the business audit of bank and other financial mechanism Server.User submits service request on terminal device 101, and service request is sent to audit server by terminal device 101 102.Auditing includes the trained Random Forest model for business audit in server 102.Server 102 is audited according to institute Service request extracts the feature vector of business applicant, will be every in the feature vector input Random Forest model of business applicant Decision tree obtains the classification results of every decision tree output in Random Forest model, certainly according to every in Random Forest model The classification results of plan tree output, determine the auditing result of service request.Auditing result is sent to terminal and set by audit server 102 Standby 101, user can check the auditing result of service request from terminal device 101.
Based on application scenario diagram shown in FIG. 1, the embodiment of the invention provides a kind of business audits based on machine learning The process of the process of method, this method can be executed by the business audit device based on machine learning, the industry based on machine learning Business audit device can be the audit server 102 in Fig. 1, as shown in Figure 2, comprising the following steps:
Step S201 obtains the service request of business applicant.
Step S202 extracts the feature vector of business applicant according to service request.
Business applicant can be personal or enterprise, and service request can be loan requests, such as personal loan request, enterprise Industry loan requests, supply chain loan requests etc..
When business applicant is personal, the feature vector of business applicant can be personal characteristics, such as personal basic Information, personal reference record, personal business record, personal asset etc..When business applicant is enterprise, the spy of business applicant Sign vector can be enterprise characteristic, such as enterprise's reference record, enterprise qualification etc..
Step S203, by every decision tree in the feature vector of business applicant input Random Forest model, obtain with The classification results that every decision tree exports in machine forest model.
Random Forest model is obtained by training sample training of history service audit data.When Random Forest model is used When auditing personal service request, history service audit data include the history audit logging of personal characteristics data, auditor.When Random Forest model is for when auditing supply chain service request, it to include chain category enterprise characteristic data, core that history service, which audits data, Heart enterprise characteristic data, auditor history audit logging.
Step S204 determines examining for service request according to the classification results that every decision tree in Random Forest model exports Core result.
In a kind of possible embodiment, the classification results of decision tree output include auditing to pass through and audit not passing through, When the classification results exported in Random Forest model are that the decision tree quantity that audit passes through is greater than the unacceptable decision tree number of audit When amount, determine that the auditing result of service request passes through for audit.When the classification results exported in Random Forest model are that audit is logical When the decision tree quantity crossed is less than audit unacceptable decision tree quantity, it is obstructed to audit to determine the auditing result of service request It crosses.
In a kind of possible embodiment, according to the classification results of every decision tree output in Random Forest model and with The corresponding weight of every decision tree in machine forest model, determines the auditing result of service request.Specifically, each decision tree is corresponding Weight be to be determined according to the importance of feature vector in decision tree, it is that the corresponding weight of all decision trees is added and be 1.
In specific implementation, the weight of the identical decision tree of classification results in Random Forest model can be added, be determined every The classified weight of a classification results, using the maximum classification results of classified weight as auditing result.
Illustratively, the classification results for setting decision tree include auditing to pass through and audit not passing through, are careful by classification results The weight for the decision tree that core passes through is added, and obtains the classified weight that audit passes through.It is to audit unacceptable decision by classification results The weight of tree is added, and is obtained and is audited unacceptable classified weight.When the classified weight that audit passes through is greater than unacceptable point of audit It when class weight, determines that auditing result passes through for audit, audits unacceptable classified weight when the classified weight that audit passes through is less than When, determine that auditing result does not pass through for audit.
Due to making examining for Random Forest model study auditor using history service audit data training Random Forest model Core behavior obtains random forest then by every decision tree in the feature vector input Random Forest model of business applicant The classification results that every decision tree exports in model, the classification knot exported later further according to every decision tree in Random Forest model Fruit determines the auditing result of service request.For expert model, Random Forest model is to audit number based on history service It is obtained according to training, is not merely the experience of professional auditor, therefore is small to the dependence of artificial experience, reduce subjective factor It influences, improves the generalization ability and versatility of audit model.
The process that Random Forest model is obtained using history service audit data training is described below, as shown in figure 3, including Following steps:
Step S301 obtains history service and audits data.
History service audit data include chain category enterprise characteristic data, core enterprise's characteristic, the history of auditor examine Core record, wherein chain category enterprise characteristic data include Lian Shu enterprise essential information (for example, establish with limited laibility the time, scope of the enterprise, Headcount, main business industry etc.), chain category enterprise operation financial report (for example, debt ratio, amount of liabilities, the external guaranty amount of money, The turnover, operating profit etc.), pledged the accounts receivable amount of money, pledged accounts receivable stroke count, accounts receivable characteristic (ratio Such as accounts receivable account phase).Core enterprise's characteristic includes core enterprise's essential information (for example, establishing time, enterprise with limited laibility Whether scale headcount, main business industry, is listed company etc.), that core enterprise manages financial report, recent public sentiment is great negative Face news quantity, accounts receivable characteristic (such as accounts receivable account phase etc.).
Step S302 audits data according to history service and determines feature vector set.
Specifically, after obtaining history service audit data, history service audit data are pre-processed.Pretreatment can be with Including following methods: mode one, since the history service of collection audits data, there may be mistake, exceptional value and missings Situation can do the operation such as tax default value, Rejection of samples to this kind of data, avoid this kind of data influence training result.Mode two, Standardization is done to specific characteristic column, such as the operating income of company is one-dimensional continuously distributed numerical data, can be done herein Section classification, such as 3,000,000 or less, 300 ten thousand to 1,000 ten thousand, 1,000 ten thousand to 5,000 ten thousand, 50,000,000 or more 4 section classifications, then It is marked.History service is audited data preparation for multidimensional data matrix, to use to training by mode three.
Step S303, extracts N number of subcharacter vector set from feature vector set, and N is default positive integer.
Specifically, N number of subcharacter vector set is extracted from feature vector set using bootstrap method.
Step S304 obtains N decision tree using the training of N number of subcharacter vector set, and N decision tree composition is random Forest model.
For each subcharacter vector set, decision is obtained using the feature vector training in the subcharacter vector set Tree, wherein decision tree can be CART (classification returns) tree.According to the importance of the feature vector in subcharacter vector set, The weight of decision tree is set, and the sum of weight of N decision tree is 1.
Using history service audit data training Random Forest model, while being arranged at random according to the importance of feature vector The weight of every decision tree in forest model, therefore when using Random Forest model audit service request, in conjunction with point of decision tree Class result and the weight of decision tree can effectively provide the accuracy of auditing result.
Embodiment in order to preferably explain the present invention is provided a loan using supply chain below and describes present invention implementation as implement scene A kind of business audit method based on machine learning that example provides, this method are held by the business audit device based on machine learning Row, as shown in figure 4, method includes the following steps:
Setting bank A has more loan transaction in supply chain financial field, and the data warehouse of operation system has accumulated confession Answer the loan enterprises application record and loan audit historical data of chain business.From business system data, row in risk data and Feature vector set Dt is extracted in people's row collage-credit data.N number of son is extracted from feature vector set Dt using bootstrap method Feature vector set { D1, D2 ..., DN }.N decision tree is obtained using the training of N number of subcharacter vector set, respectively T1, T2 ..., TN, according to the importance of the feature vector in subcharacter vector set, the weight that N decision tree is arranged is respectively { a1, a2 ..., a N }, the sum of weight of N decision tree are that 1, N decision tree forms Random Forest model.
A property finishing has occurred between C real estate company, core enterprise in supply chain and Lian Shu enterprise B decoration corporation Business, needs C real estate company after standby service to pay B decoration corporation by 5,000,000 yuan of the business amount of money, and B decoration corporation needs at this time It wants business funds to have enough to meet the need, passes through the accounts receivable of this business trade background, Xiang Yinhang A application loan three and one-half million member.Bank A After the loan requests for receiving B decoration corporation, Cong Renhang reference platform inquires the reference record of B decoration corporation, including history is borrowed Money registration record, pledge record, assets pledge record, enterprise's external guaranty record etc., then from enterprise register in Administration for Industry and Commerce data Source, obtain C real estate company and B decoration corporation enterprise characteristic data, such as scope of the enterprise, set up the time, registered enterprise fund, Manage financial report, accounts receivable characteristic etc..The feature of C real estate company is extracted from the data being related to above for this loan Feature vector is separately input in N decision tree by vector, and every decision tree exports a classification results, wherein classification results Pass through and audit including audit and does not pass through.The weight that classification results are the decision tree that audit passes through is added, audit is obtained and passes through Classified weight.It is the weight addition for auditing unacceptable decision tree by classification results, obtains and audit unacceptable classified weight. When the classified weight that audit passes through, which is greater than, audits unacceptable classified weight, determines that auditing result passes through for audit, work as audit By classified weight be less than audit unacceptable classified weight when, determine auditing result for audit do not pass through.
Due to making examining for Random Forest model study auditor using history service audit data training Random Forest model Core behavior obtains random forest then by every decision tree in the feature vector input Random Forest model of business applicant The classification results that every decision tree exports in model, the classification knot exported later further according to every decision tree in Random Forest model Fruit determines the auditing result of service request.For expert model, Random Forest model is to audit number based on history service It is obtained according to training, is not merely the experience of professional auditor, therefore is small to the dependence of artificial experience, reduce subjective factor It influences, improves the generalization ability and versatility of audit model.
Based on the same technical idea, the embodiment of the invention provides a kind of business audit device based on machine learning, As shown in figure 5, the device 500 includes:
Module 501 is obtained, for obtaining the service request of business applicant;
Extraction module 502, for extracting the feature vector of the business applicant according to the service request;
Categorization module 503, certainly for every in the feature vector input Random Forest model by the business applicant Plan tree obtains the classification results that every decision tree exports in the Random Forest model, and the Random Forest model is with history Business audit data are that training sample training obtains;
Processing module 504, the classification results for being exported according to every decision tree in the Random Forest model, determines institute State the auditing result of service request.
Optionally, the processing module 504 is specifically used for:
It is every in the classification results and the Random Forest model exported according to every decision tree in the Random Forest model The corresponding weight of decision tree, determines the auditing result of the service request.
Optionally, the processing module 504 is specifically used for:
The weight of the identical decision tree of classification results in the Random Forest model is added, determines each classification results Classified weight;
Using the maximum classification results of classified weight as auditing result.
Optionally, the service request is supply chain service request, and the history service audit data include Lian Shu enterprise Characteristic, core enterprise's characteristic, auditor history audit logging.
Optionally, the categorization module 503 is specifically used for:
It obtains history service and audits data;
Data, which are audited, according to the history service determines feature vector set;
N number of subcharacter vector set is extracted from described eigenvector set, the N is default positive integer;
N decision tree is obtained using N number of subcharacter vector set training;
N decision tree is formed into Random Forest model.
Based on the same technical idea, the embodiment of the invention provides a kind of computer equipments, as shown in fig. 6, including extremely Lack a processor 601, and the memory 602 connecting at least one processor, does not limit processing in the embodiment of the present invention Specific connection medium between device 601 and memory 602 passes through bus between processor 601 and memory 602 in Fig. 6 and connects For.Bus can be divided into address bus, data/address bus, control bus etc..
In embodiments of the present invention, memory 602 is stored with the instruction that can be executed by least one processor 601, at least The instruction that one processor 601 is stored by executing memory 602, can execute the business audit above-mentioned based on machine learning Included step in method.
Wherein, processor 601 is the control centre of computer equipment, can use various interfaces and connection computer The various pieces of equipment are stored in memory 602 by running or executing the instruction being stored in memory 602 and calling Data, to carry out business audit.Optionally, processor 601 may include one or more processing units, and processor 601 can Integrated application processor and modem processor, wherein the main processing operation system of application processor, user interface and application Program etc., modem processor mainly handle wireless communication.It is understood that above-mentioned modem processor can not also It is integrated into processor 601.In some embodiments, processor 601 and memory 602 can be realized on the same chip, In some embodiments, they can also be realized respectively on independent chip.
Processor 601 can be general processor, such as central processing unit (CPU), digital signal processor, dedicated integrated Circuit (Application Specific Integrated Circuit, ASIC), field programmable gate array or other can Perhaps transistor logic, discrete hardware components may be implemented or execute present invention implementation for programmed logic device, discrete gate Each method, step and logic diagram disclosed in example.General processor can be microprocessor or any conventional processor Deng.The step of method in conjunction with disclosed in the embodiment of the present invention, can be embodied directly in hardware processor and execute completion, Huo Zheyong Hardware and software module combination in processor execute completion.
Memory 602 is used as a kind of non-volatile computer readable storage medium storing program for executing, can be used for storing non-volatile software journey Sequence, non-volatile computer executable program and module.Memory 602 may include the storage medium of at least one type, It such as may include flash memory, hard disk, multimedia card, card-type memory, random access storage device (Random Access Memory, RAM), static random-access memory (Static Random Access Memory, SRAM), may be programmed read-only deposit Reservoir (Programmable Read Only Memory, PROM), read-only memory (Read Only Memory, ROM), band Electrically erasable programmable read-only memory (Electrically Erasable Programmable Read-Only Memory, EEPROM), magnetic storage, disk, CD etc..Memory 602 can be used for carrying or storing have instruction or data The desired program code of structure type and can by any other medium of computer access, but not limited to this.The present invention is real Applying the memory 602 in example can also be circuit or other devices that arbitrarily can be realized store function, for storing program Instruction and/or data.
Based on the same technical idea, it the embodiment of the invention provides a kind of computer readable storage medium, is stored with The computer program that can be executed by computer equipment, when described program is run on a computing device, so that the computer Equipment executes the step of business audit method based on machine learning.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method or computer program product. Therefore, complete hardware embodiment, complete software embodiment or embodiment combining software and hardware aspects can be used in the present invention Form.It is deposited moreover, the present invention can be used to can be used in the computer that one or more wherein includes computer usable program code The shape for the computer program product implemented on storage media (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) Formula.
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program product Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
Although preferred embodiments of the present invention have been described, it is created once a person skilled in the art knows basic Property concept, then additional changes and modifications may be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the scope of the invention.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art Mind and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to include these modifications and variations.

Claims (10)

1. a kind of business audit method based on machine learning characterized by comprising
The service request of acquisition business applicant;
The feature vector of the business applicant is extracted according to the service request;
By every decision tree in the feature vector input Random Forest model of the business applicant, the random forest is obtained The classification results that every decision tree exports in model, the Random Forest model are using history service audit data as training sample What training obtained;
According to the classification results that every decision tree in the Random Forest model exports, the audit knot of the service request is determined Fruit.
2. the method as described in claim 1, which is characterized in that described defeated according to every decision tree in the Random Forest model Classification results out determine the auditing result of the service request, comprising:
It determines for every in the classification results and the Random Forest model exported according to every decision tree in the Random Forest model The corresponding weight of plan tree, determines the auditing result of the service request.
3. method according to claim 2, which is characterized in that described defeated according to every decision tree in the Random Forest model The corresponding weight of every decision tree in classification results and the Random Forest model out, determines the audit knot of the service request Fruit, comprising:
The weight of the identical decision tree of classification results in the Random Forest model is added, determines the classification of each classification results Weight;
Using the maximum classification results of classified weight as auditing result.
4. the method as described in claims 1 to 3 is any, which is characterized in that the service request is supply chain service request, institute State history service audit data include chain category enterprise characteristic data, core enterprise's characteristic, auditor history audit logging.
5. method as claimed in claim 4, which is characterized in that the Random Forest model is to be with history service audit data Training sample training obtains, comprising:
It obtains history service and audits data;
Data, which are audited, according to the history service determines feature vector set;
N number of subcharacter vector set is extracted from described eigenvector set, the N is default positive integer;
N decision tree is obtained using N number of subcharacter vector set training;
N decision tree is formed into Random Forest model.
6. a kind of business audit device based on machine learning characterized by comprising
Module is obtained, for obtaining the service request of business applicant;
Extraction module, for extracting the feature vector of the business applicant according to the service request;
Categorization module is obtained for every decision tree in the feature vector input Random Forest model by the business applicant The classification results of every decision tree output in the Random Forest model are obtained, the Random Forest model is audited with history service Data are that training sample training obtains;
Processing module, the classification results for being exported according to every decision tree in the Random Forest model, determines the business The auditing result of request.
7. device as claimed in claim 6, which is characterized in that the processing module is specifically used for:
It determines for every in the classification results and the Random Forest model exported according to every decision tree in the Random Forest model The corresponding weight of plan tree, determines the auditing result of the service request.
8. device as claimed in claim 7, which is characterized in that the processing module is specifically used for:
The weight of the identical decision tree of classification results in the Random Forest model is added, determines the classification of each classification results Weight;
Using the maximum classification results of classified weight as auditing result.
9. a kind of computer equipment including memory, processor and stores the meter that can be run on a memory and on a processor Calculation machine program, which is characterized in that the processor is realized described in Claims 1 to 5 any claim when executing described program The step of method.
10. a kind of computer readable storage medium, which is characterized in that it is stored with the computer journey that can be executed by computer equipment Sequence, when described program is run on a computing device, so that computer equipment perform claim requirement 1~5 is any described The step of method.
CN201910533825.2A 2019-06-19 2019-06-19 Business auditing method and device based on machine learning Active CN110264342B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910533825.2A CN110264342B (en) 2019-06-19 2019-06-19 Business auditing method and device based on machine learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910533825.2A CN110264342B (en) 2019-06-19 2019-06-19 Business auditing method and device based on machine learning

Publications (2)

Publication Number Publication Date
CN110264342A true CN110264342A (en) 2019-09-20
CN110264342B CN110264342B (en) 2024-06-28

Family

ID=67919576

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910533825.2A Active CN110264342B (en) 2019-06-19 2019-06-19 Business auditing method and device based on machine learning

Country Status (1)

Country Link
CN (1) CN110264342B (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111612166A (en) * 2020-04-23 2020-09-01 中国科学院计算机网络信息中心 Reimbursement time prediction method based on machine learning
CN112258135A (en) * 2020-05-15 2021-01-22 北京沃东天骏信息技术有限公司 Method and device for auditing prescription data and computer-readable storage medium
CN112308466A (en) * 2020-11-26 2021-02-02 东莞市盟大塑化科技有限公司 Enterprise qualification auditing method and device, computer equipment and storage medium
CN112579579A (en) * 2019-09-30 2021-03-30 北京国双科技有限公司 Material mobile data auditing method and device, storage medium and electronic equipment
CN112734352A (en) * 2019-10-28 2021-04-30 北京京东尚科信息技术有限公司 Document auditing method and device based on data dimensionality
CN113159175A (en) * 2021-04-21 2021-07-23 平安科技(深圳)有限公司 Data prediction method, device, equipment and storage medium
CN113435842A (en) * 2021-06-28 2021-09-24 京东科技控股股份有限公司 Business process processing method and computer equipment
CN113705668A (en) * 2021-08-27 2021-11-26 创新奇智(广州)科技有限公司 Method, device, equipment and storage medium for detecting working state of component
CN114202399A (en) * 2021-12-13 2022-03-18 金蝶软件(中国)有限公司 Intelligent approval method and related device
CN114496196A (en) * 2022-02-18 2022-05-13 潍坊医学院附属医院 Automatic auditing system for clinical biochemical inspection in medical laboratory
WO2024021555A1 (en) * 2022-07-29 2024-02-01 京东科技信息技术有限公司 Resource examination and approval method and device, and random forest model training method and device

Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8818910B1 (en) * 2013-11-26 2014-08-26 Comrise, Inc. Systems and methods for prioritizing job candidates using a decision-tree forest algorithm
CN105279691A (en) * 2014-07-25 2016-01-27 中国银联股份有限公司 Financial transaction detection method and equipment based on random forest model
CN107092827A (en) * 2017-03-30 2017-08-25 中国民航大学 A kind of Android malware detection method based on improvement forest algorithm
CN107766883A (en) * 2017-10-13 2018-03-06 华中师范大学 A kind of optimization random forest classification method and system based on weighted decision tree
CN108510507A (en) * 2018-03-27 2018-09-07 哈尔滨理工大学 A kind of 3D vertebra CT image active profile dividing methods of diffusion-weighted random forest
CN108665159A (en) * 2018-05-09 2018-10-16 深圳壹账通智能科技有限公司 A kind of methods of risk assessment, device, terminal device and storage medium
CN109145965A (en) * 2018-08-02 2019-01-04 深圳辉煌耀强科技有限公司 Cell recognition method and device based on random forest disaggregated model
CN109214914A (en) * 2018-08-24 2019-01-15 厦门集微科技有限公司 A kind of loan information checking method and device based on communication open platform
CN109359669A (en) * 2018-09-10 2019-02-19 平安科技(深圳)有限公司 Method for detecting abnormality, device, computer equipment and storage medium are submitted an expense account in medical insurance
CN109389490A (en) * 2018-09-26 2019-02-26 深圳壹账通智能科技有限公司 Loan product matching process, device, computer equipment and storage medium
CN109460872A (en) * 2018-11-14 2019-03-12 重庆邮电大学 One kind being lost unbalanced data prediction technique towards mobile communication subscriber
CN109564677A (en) * 2018-11-09 2019-04-02 香港应用科技研究院有限公司 Super-resolution synthesis system and method based on random forest grader weighted results
CN109726826A (en) * 2018-12-19 2019-05-07 东软集团股份有限公司 Training method, device, storage medium and the electronic equipment of random forest
CN109829471A (en) * 2018-12-19 2019-05-31 东软集团股份有限公司 Training method, device, storage medium and the electronic equipment of random forest

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8818910B1 (en) * 2013-11-26 2014-08-26 Comrise, Inc. Systems and methods for prioritizing job candidates using a decision-tree forest algorithm
CN105279691A (en) * 2014-07-25 2016-01-27 中国银联股份有限公司 Financial transaction detection method and equipment based on random forest model
CN107092827A (en) * 2017-03-30 2017-08-25 中国民航大学 A kind of Android malware detection method based on improvement forest algorithm
CN107766883A (en) * 2017-10-13 2018-03-06 华中师范大学 A kind of optimization random forest classification method and system based on weighted decision tree
CN108510507A (en) * 2018-03-27 2018-09-07 哈尔滨理工大学 A kind of 3D vertebra CT image active profile dividing methods of diffusion-weighted random forest
CN108665159A (en) * 2018-05-09 2018-10-16 深圳壹账通智能科技有限公司 A kind of methods of risk assessment, device, terminal device and storage medium
CN109145965A (en) * 2018-08-02 2019-01-04 深圳辉煌耀强科技有限公司 Cell recognition method and device based on random forest disaggregated model
CN109214914A (en) * 2018-08-24 2019-01-15 厦门集微科技有限公司 A kind of loan information checking method and device based on communication open platform
CN109359669A (en) * 2018-09-10 2019-02-19 平安科技(深圳)有限公司 Method for detecting abnormality, device, computer equipment and storage medium are submitted an expense account in medical insurance
CN109389490A (en) * 2018-09-26 2019-02-26 深圳壹账通智能科技有限公司 Loan product matching process, device, computer equipment and storage medium
CN109564677A (en) * 2018-11-09 2019-04-02 香港应用科技研究院有限公司 Super-resolution synthesis system and method based on random forest grader weighted results
CN109460872A (en) * 2018-11-14 2019-03-12 重庆邮电大学 One kind being lost unbalanced data prediction technique towards mobile communication subscriber
CN109726826A (en) * 2018-12-19 2019-05-07 东软集团股份有限公司 Training method, device, storage medium and the electronic equipment of random forest
CN109829471A (en) * 2018-12-19 2019-05-31 东软集团股份有限公司 Training method, device, storage medium and the electronic equipment of random forest

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112579579A (en) * 2019-09-30 2021-03-30 北京国双科技有限公司 Material mobile data auditing method and device, storage medium and electronic equipment
CN112734352A (en) * 2019-10-28 2021-04-30 北京京东尚科信息技术有限公司 Document auditing method and device based on data dimensionality
CN111612166B (en) * 2020-04-23 2022-10-25 中国科学院计算机网络信息中心 Reimbursement time prediction method based on machine learning
CN111612166A (en) * 2020-04-23 2020-09-01 中国科学院计算机网络信息中心 Reimbursement time prediction method based on machine learning
CN112258135A (en) * 2020-05-15 2021-01-22 北京沃东天骏信息技术有限公司 Method and device for auditing prescription data and computer-readable storage medium
CN112308466A (en) * 2020-11-26 2021-02-02 东莞市盟大塑化科技有限公司 Enterprise qualification auditing method and device, computer equipment and storage medium
CN113159175A (en) * 2021-04-21 2021-07-23 平安科技(深圳)有限公司 Data prediction method, device, equipment and storage medium
CN113159175B (en) * 2021-04-21 2023-06-06 平安科技(深圳)有限公司 Data prediction method, device, equipment and storage medium
CN113435842A (en) * 2021-06-28 2021-09-24 京东科技控股股份有限公司 Business process processing method and computer equipment
CN113705668A (en) * 2021-08-27 2021-11-26 创新奇智(广州)科技有限公司 Method, device, equipment and storage medium for detecting working state of component
CN114202399A (en) * 2021-12-13 2022-03-18 金蝶软件(中国)有限公司 Intelligent approval method and related device
CN114496196A (en) * 2022-02-18 2022-05-13 潍坊医学院附属医院 Automatic auditing system for clinical biochemical inspection in medical laboratory
WO2024021555A1 (en) * 2022-07-29 2024-02-01 京东科技信息技术有限公司 Resource examination and approval method and device, and random forest model training method and device

Also Published As

Publication number Publication date
CN110264342B (en) 2024-06-28

Similar Documents

Publication Publication Date Title
CN110264342A (en) A kind of business audit method and device based on machine learning
Guo et al. Blockchain application and outlook in the banking industry
CN110458693A (en) A kind of automatic measures and procedures for the examination and approval of business loan, device, storage medium and electronic equipment
CN110276620A (en) It is a kind of to determine the method and device traded extremely
CN108665366A (en) Determine method, terminal device and the computer readable storage medium of consumer's risk grade
CN108230140A (en) The method and apparatus of pushed information, the method and apparatus for determining input default value
CN109740792A (en) Data predication method, system, terminal and computer storage medium
CN108629685A (en) loan product attribute determining method and server
Vasylieva et al. Scientific and methodical approaches to determining the center-orientation of financial conglomerates with the factor and cluster analysis
Pellegrino et al. Digital financing for SMEs' recovery in the post-COVID era: A bibliometric review
CN102915466A (en) Data processing method and device
US11798100B2 (en) Transaction counterpart identification
Ramzan et al. Impact of asset preferences on firm performance over its life cycle: Is agency theory or neo‐classical theory more relevant?
Biswas et al. Automated credit assessment framework using ETL process and machine learning
CN110276692B (en) Method and device for processing transaction data
Kaminskyi et al. Information technology model for customer relationship management of nonbank lenders: coupling profitability and risk
Bakoeva The Essence Of Transformation Of The Banking Sector In The Context Of The Implementation Of Modern Financial Technologies
Vovchak et al. Bank business modeling and levels of non-performing loans: Perspectives of international risk factors in Ukraine
Reddy et al. Peer to peer lending, default prediction-evidence from lending club
Egorova et al. Digital currency in the development of payment systems on the bitcoin platform
Topaloglou et al. Investors’ Behavior in Cryptocurrency Market
Reyes-Mercado et al. How fintech was born?
Liu et al. Research on the identification and prediction of default risk of online lending platform customers
CN109657897A (en) Mobility notch method of adjustment, device, equipment and storage medium
Chen et al. Risk and Return of Securities Investment in the Context of Big Data and Artificial Intelligence

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant