Stars
The repository contains one gpu kernel each day :)
OpenSeek aims to unite the global open source community to drive collaborative innovation in algorithms, data and systems to develop next-generation models that surpass DeepSeek.
Fine-tune LLMs for free with 100+ Notebooks on Google Colab, Kaggle, and more.
A tool to crawl GitHub repositories, extract their content, and generate structured datasets for training LLMs.
Ongoing research training transformer models at scale
A training framework for large-scale language models based on Megatron-Core, the COOM Training Framework is designed to efficiently handle extensive model training inspired by Deepseek's HAI-LLM op…
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
🚀✨ Minimalistic, powerful and extremely customizable Zsh prompt
marszall87 / lambda-pure
Forked from sindresorhus/purePretty, minimal and fast ZSH prompt, with NodeJS version
RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry
A Python PySpark Projet with Poetry
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
It is my belief that you, the postgraduate students and job-seekers for whom the book is primarily meant will benefit from reading it; however, it is my hope that even the most experienced research…
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
For extensive instructor led learning
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
https://huyenchip.com/ml-interviews-book/
Code Repository for The Kaggle Book, Published by Packt Publishing
Approaching (Almost) Any Machine Learning Problem
A repository listing out the potential sources which will help you in preparing for a Data Science/Machine Learning interview. New resources added frequently.
This repository is to prepare for Machine Learning interviews.
A repo for data science related questions and answers
Machine Learning and Computer Vision Engineer - Technical Interview Questions
Data science interview questions and answers
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Data science interview questions with answers. Not ideally (yet)