- Venezuela
- @PastorSotoB1
- in/pastor-soto-34215b125
Highlights
- Pro
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
This is a repo with links to everything you'd ever want to learn about data engineering
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser, Trae AI & Cluely (And other Open Sourced) System Prompts, Tools & AI Models.
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
FastAPI framework, high performance, easy to learn, fast to code, ready for production
A model context protocol server that connects to Anki through AnkiConnect
Fetch message history from discord for LLMs
Data engineering study group for the data engineering specialization by Deeplearning AI
A curated list of free courses with certifications. Also available at https://free-certifications.com/
TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inferring the shared underlying visual template across documents
Tool for generating high quality Synthetic datasets
A system for agentic LLM-powered data processing and ETL
Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning
Python tool for converting files and office documents to Markdown.
In-depth tutorials on LLMs, RAGs and real-world AI agent applications.
code for Data Science From Scratch book
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
A Kubernetes deployable instance of GroundX for document parsing, storage, and search.
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
Master programming by recreating your favorite technologies from scratch.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
The Self-hosted AI Starter Kit is an open-source template that quickly sets up a local AI environment. Curated by n8n, it provides essential tools for creating secure, self-hosted AI workflows.
Friends of Tracking: Tutorial on valuing actions in football.
Convert soccer event stream data to SPADL and value player actions using VAEP or xT
Build a machine learning curriculum that actually match job requirements