Stars
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
GitHub Action for continuous benchmarking to keep performance
Distributed query engine providing simple and reliable data processing for any modality and scale
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (…
🎓 Path to a free self-taught education in Computer Science!
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
A portable Pythonic Data Lakehouse powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your ML and analytics workloads.
Snowflake is a network service for generating unique ID numbers at high scale with some simple guarantees.
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Modin: Scale your Pandas workflows by changing a single line of code
The most popular open source electronic health records and medical practice management solution.
🌐 Front End interview preparation materials for busy engineers (updated for 2025)
Roadmap to becoming an Artificial Intelligence Expert in 2022
Lab Materials for MIT 6.S191: Introduction to Deep Learning
JavaScript API for face detection and face recognition in the browser and nodejs with tensorflow.js
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
A collection of design patterns/idioms in Python
HTTP load testing tool and library. It's over 9000!
React Dropdown component
React typeahead with Bootstrap styling
Homework and projects for Northwestern Data Science Bootcamp
A collection of examples, tips and tricks and snippets of scripting for the Jenkins Pipeline plugin