Building a production-ready feature pipeline is real-time

Tools

Apache Kafka
Python
Quix Streamx

Aim

The ultimate goal is to build a trading boat that is powered by ML.

Before even thinking about how the ML model will do its thing, we'll need to design, develop and deploy a real-time feature pipeline that produces the features needed by the model both at training and at inference.

The pipeline has 3 parts:

Ingestion of raw data from an external service. This would be raw trades. Kraken Websocket API will do.
Transform these trades into features for the ML model.
Saving these features in a Feature Store, to be fetched by the ML modelto generate both the training data and real-time predictions.

In a real-world setting, each of the above processes is implemented as a separate service, and communication between these services happens through a message broker like Kafka.

This way, yoursystem becomes scalable by spinning up more containers as needed, and leveraging Kafka consumer groups.

Wanna learn more about ML?

→ Take a look 🤗

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Transformation		Transformation
injestion		injestion
.env.sample		.env.sample
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml
quix.yaml		quix.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Building a production-ready feature pipeline is real-time

Tools

Table of contents

Aim

Wanna learn more about ML?

About

Releases

Packages

Languages

chaba-victor/Real-time-data-pipelin

Folders and files

Latest commit

History

Repository files navigation

Building a production-ready feature pipeline is real-time

Tools

Table of contents

Aim

Wanna learn more about ML?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages