8000 Define the Kiba Pro offering · Issue #20 · thbar/kiba · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Define the Kiba Pro offering #20

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
thbar opened this issue Jul 15, 2015 · 14 comments
Closed

Define the Kiba Pro offering #20

thbar opened this issue Jul 15, 2015 · 14 comments

Comments

@thbar
Copy link
Owner
thbar commented Jul 15, 2015

As first mentioned on my blog I plan to launch a Kiba Pro offering. Please comment here or email me to discuss more details, manifest your interest and support, this will be greatly appreciated.

Sustainable OSS

Especially after maintaining activewarehouse-etl for quite a bit of time, I'm willing to find a sustainable way to keep a high quality ETL solution for Ruby for the years to come, without killing my solopreneur-business.

If I'd paraphrase Mike Perham's statement with Sidekiq-Pro (see Sidekiq-Pro FAQ), "Kiba Pro is (well, will be) an extension which adds a few nice features to Kiba. Kiba is open source and free for all to use but unfortunately it takes a lot of my time to maintain and support. Kiba Pro is a way for you to purchase really useful functionality while also ensuring Kiba will be supported for years to come".

A recurring yearly subscription similar to Sidekiq Pro would secure some of my time to maintain and improve the solution and avoid the fate of activewarehouse-etl.

Potential features

Features could include:

  • multi-threading (and later, multi-machines) - commonly requested
  • built-in sources/transforms/destinations for common tasks
    • lookups
    • upserts / bulk load modules
    • connectors optimized for parallelism (HTTP pagination extraction)
    • connectors for the cloud (RedShift, ...)
  • built-in helpers for common operations (debugging, limiting, caching...)
  • premium support

Interested? Please chime in!

I'm pushing this publicly for transparency and also to find the "good" feature set, the one that larger companies would be willing to support my work on Kiba while getting extra value.

Do not hesitate to email me if your company could be interested in supporting richer features.

@dhonig
Copy link
dhonig commented Jul 15, 2015

Would this multi threaded implementation take use of something like Celluloid?

@thbar
Copy link
Owner Author
thbar commented Jul 15, 2015

@dhonig I have started several prototypes and I'm not sure yet what the first published implementation will rely on (it could be with Celluloid, or just concurrent-ruby, or even more simple depending on my findings). I need to investigate further!

May I ask: would you have a preference to see a Celluloid-based solution, or the contrary? Any insight is most welcome.

@dhonig
Copy link
dhonig commented Jul 15, 2015

I'm familiar with the Celluloid idea of actors from Erlang and Akka...I think it could be a big win to use actors here. But I'm all for the simplest and best solution without introducing another heavy weight dependency. Not too familiar with concurrent-ruby but after a quick glance, I still think celluloid actors might be a simpler way forward.

@thbar
Copy link
Owner Author
thbar commented Jul 15, 2015

I do like actors too, especially since here it seems to make a lot of sense given how the processing occurs. I will keep experimenting with Celluloid indeed, and see if I get enough control. Thanks for those extra details, appreciated!

@mroby-dresources
Copy link

Any chance of automated rollback or versioning? You can roll your own, but it would be great to have a common way of handling it as part of the ETL process for frequently updated data.

Sorry, something went wrong.

@thbar
Copy link
Owner Author
thbar commented Dec 3, 2015

@mroby-dresources can you email me (thibaut.barrere+kiba@gmail.com) so we can discuss this in more depth? Thanks!

@Daniel-ltw
Copy link

https://github.com/grosser/parallel

This would help easily with multi processor and multi threading.

@standingwave
Copy link

Bumping this. Love the current features but looking at rolling some multi-threading support myself since it looks like progress has stalled on that work?

@thbar
Copy link
Owner Author
thbar commented May 17, 2016

@standingwave work hasn't stalled actually! I now have an "alpha" quality multi-threaded kiba-pro runner which I'm currently packing up for a first release. Hit me up via email (thibaut.barrere+kiba@gmail.com) and I'll add you to the "alpha" as soon as it's ready (a bit of extra testing is required, and packaging as well, but I'm getting there).

@thbar thbar self-assigned this May 17, 2016
@thbar
Copy link
Owner Author
thbar commented Nov 14, 2016

After several projects with clients, the first component will be a flexible & fast SQL insert destination, as I saw this provided more leverage than multithreading usually. Closing this for now, and I'll update the README when the Pro bundle is available.

@thbar thbar closed this as completed Nov 14, 2016
@dhonig
Copy link
dhonig commented Nov 14, 2016

@thbar The multithreaded version = Elixir right ?

@thbar
Copy link
Owner Author
thbar commented Nov 14, 2016

@dhonig Ruby as well! While I do have plans to ship Elixir versions of Kiba Pro components, I also plan to release multithreading-enabled features in the Ruby version.

@dhonig
Copy link
dhonig commented Nov 14, 2016

@thbar exciting! We have used Kiba on several projects now and an Elixir version seems like a great fit!

@thbar
Copy link
Owner Author
thbar commented Nov 16, 2016

@dhonig glad you find Kiba useful :-) And yes, I'm coding more with Elixir on ETL related tasks, and will definitely publish something at some point around that!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants
0