-
copado
- Ljubljana, Slovenia
- copado.com
- @canimus
- in/herminio-vazquez-701bb0
- https://orcid.org/0000-0003-1937-8006
-
python-workshop-fp Public
Functional programming workshop and exercises
-
cuallee Public
Possibly the fastest DataFrame-agnostic quality check library in town.
-
dagster-capstone Public
A sample project that uses the foundation concepts for dagster orchestration
-
-
100dem Public
100 data engineering mistakes
Creative Commons Zero v1.0 Universal UpdatedMar 9, 2024 -
aiowebhdfs Public
A modern and async implementation of the WebHDFS API in python
-
google-cloud-container Public
A container ready to connect to Google Cloud Storage and BigQuery
Dockerfile UpdatedJan 23, 2024 -
currency-daily Public
A temp project to download currencies in a daily job
-
mCIRCrna Public
Forked from u-brite/mCIRCrnaIntegrating circadian transcriptomes with muscle snRNA-seq data
MIT License UpdatedAug 5, 2022 -
filehole Public
Forked from vestalisvirginis/fileholePython library to find missing files in a scheduled delivery.
Python MIT License UpdatedJun 26, 2022 -
datafrake Public
A dataframe generator with fake data for unit testing your data pipelines
Python GNU General Public License v3.0 UpdatedNov 21, 2021 -
-
-
azure-datafactory Public
A test repository for testing the data factory capabilities for data pipelines
GNU General Public License v3.0 UpdatedMay 8, 2021 -
job1 Public
A simple example on how to use the Apache Spark on Python for Streaming on sockets
GNU General Public License v3.0 UpdatedMay 3, 2021 -
-
-
genomics Public
A docker container with all libraries required for sequencing FASTA
-
typefu Public
An utility library to convert data types between data sources and different file formats
-
alphareader Public
A custom reader for delimited files in Python. Ability to ingest big data files.
-
luigi Public
Forked from spotify/luigiLuigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Python Apache License 2.0 UpdatedNov 7, 2019 -
auto-eda Public
Forked from darenasc/auto-edaAutomated Exploratory Data Analysis. Simplifying Data Exploration
PLpgSQL UpdatedOct 24, 2019 -
ProjectQ Public
Forked from huawei-quantum-challenges/ProjectQProjectQ: An open source software framework for quantum computing
-
bioformats Public
A containerized version of the python-bioformats library to read .lif images with the javabridge setup dome
Python UpdatedMar 3, 2019 -
chronograf Public
Forked from influxdata/chronografOpen source monitoring and visualization UI for the TICK stack
TypeScript Other UpdatedNov 29, 2018 -
swarmpit Public
Forked from swarmpit/swarmpitLightweight Docker Swarm management UI
Clojure Eclipse Public License 1.0 UpdatedSep 11, 2018 -
priceengine Public
A demonstration of how to kill flies with a shotgun
Python MIT License UpdatedAug 24, 2018 -
pytalent Public
A AI enabled app to scan curriculums and provide a visualization of skills
MIT License UpdatedAug 14, 2018 -
nl-geojson Public
Forked from larsbouwens/nl-geojsonThe Netherlands gejson map
HTML UpdatedJul 17, 2018 -