8000 GitHub - yinhuagang/shifu: Legacy 0.2.x
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

yinhuagang/shifu

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

84 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Getting Started

Please visit shifu.ml for download infomation, installation instructions, and tutorials.

What is Shifu?

Shifu is an open-source, end-to-end machine learning and data mining framework built on top of Hadoop. Shifu is designed for data scientists, simplifying the life-cycle of building machine learning models. While originally built for fraud modeling, Shifu is generalized for many other modeling domains.

Shifu provides a simple command-line interface for each step of the model building process, including

  • Statistic calculation & variable selection to determine the most predictive variables in your data
  • Variable normalization
  • Distributed neural network model training
  • Post training analysis & model evaluation

Shifu’s fast Hadoop-based, distributed neural network training can reduce model training time from days to hours on 500GB data sets. Shifu integrates with Pig workflows on Hadoop, and Shifu-trained models can be integrated into production code with a simple Java API. Shifu leverages Pig, Akka, Encog and other open source projects.

Contributors

  • Zhanghao Hu
  • Grahame Jastrebski
  • Lavar Li
  • Mark Liu
  • David Zhang
  • Xin Zhong

Copyright and License

Copyright 2012-2014, eBay Software Foundation under the Apache License.

About

Legacy 0.2.x

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Java 98.3%
  • Other 1.7%
0