BlueKiji77

D. Marcos BlueKiji77

Self-studying machine learning practitioner. Eager to explore applications of Geometric Deep Learning, Reinforcement Learning and Unsupervised Learning.

6 followers · 59 following

Coming Soon
@DayaneMarcos_AI

Lists (1)

Sort

ToLearn | ReImplement

20 repositories

Stars

Genesis-Embodied-AI / Genesis

A generative world for general-purpose robotics & embodied AI learning.

Python 25,263 2,269 Updated Jun 12, 2025

eemlcommunity / PracticalSessions2024

Jupyter Notebook 80 20 Updated Aug 7, 2024

xbresson / GML2023

Graph Machine Learning course, Xavier Bresson, 2023

Jupyter Notebook 612 97 Updated Aug 27, 2024

firmai / financial-machine-learning

A curated list of practical financial machine learning tools and applications.

Python 7,929 1,343 Updated Jan 3, 2025

HandsOnLLM / Hands-On-Large-Language-Models

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 10,010 2,333 Updated Jun 6, 2025

a16z-infra / llm-app-stack

1,194 130 Updated Jul 26, 2024

AI-Maker-Space / Chainlit-Event-AIM

A Chainlit App Used to Showcase: Async, Caching, Additional Chainlit Methods, and more!

Python 11 3 Updated Oct 1, 2024

darshil3011 / AutoMetaRAG

Dynamic Metadata based RAG Framework

Jupyter Notebook 75 8 Updated Jul 29, 2024

Cinnamon / kotaemon

An open-source RAG-based tool for chatting with your documents.

Python 22,443 1,783 Updated Jun 11, 2025

run-llama / llama_extract

122 19 Updated Feb 28, 2025

PacktPublishing / Game-Development-Patterns-with-Unreal-Engine-5

Game Development Patterns with Unreal Engine 5, published by Packt

C++ 71 24 Updated Jan 8, 2024

tencent-ailab / persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"

Python 1,185 89 Updated Feb 19, 2025

microsoft / MInference

[NeurIPS'24 Spotlight, ICLR'25, ICML'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filli…

Python 1,050 53 Updated Jun 12, 2025

Grimrukh / SoulsAI

Cleaned-up Dark Souls AI scripts that provide a better starting point for modding.

Lu BACE a 50 2 Updated Feb 11, 2020

mlc-ai / mlc-llm

Universal LLM Deployment Engine with ML Compilation

Python 20,776 1,745 Updated Jun 8, 2025

ibm-granite / granite-code-models

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

1,215 82 Updated Nov 13, 2024

kingjulio8238 / Memary

The Open Source Memory Layer For Autonomous Agents

Jupyter Notebook 2,247 167 Updated Oct 22, 2024

SqueezeAILab / KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 358 31 Updated Aug 13, 2024

parthsarthi03 / raptor

The official implementation of RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

Python 1,252 170 Updated Sep 3, 2024

NVIDIA / TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…

C++ 10,718 1,490 Updated Jun 12, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,459 7,925 Updated Jun 12, 2025

pytorch-labs / gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Python 5,985 556 Updated Apr 11, 2025

FasterDecoding / REST

REST: Retrieval-Based Speculative Decoding, NAACL 2024

C 202 14 Updated Dec 2, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,542 177 Updated Jun 25, 2024

letta-ai / letta

Letta (formerly MemGPT) is the stateful agents framework with memory, reasoning, and context management.

Python 16,827 1,749 Updated Jun 12, 2025

nbasyl / DoRA

Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"

123 4 Updated Apr 28, 2024

stanfordnlp / pyreft

Stanford NLP Python library for Representation Finetuning (ReFT)

Python 1,487 126 Updated Feb 6, 2025

lumpenspace / raft

RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly suited for the creation of agents that realistically emulate a …

Python 125 14 Updated Aug 31, 2024

chujiezheng / chat_templates

Chat Templates for 🤗 HuggingFace Large Language Models

Jinja 671 63 Updated Dec 13, 2024

openai / tiktoken

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 14,787 1,077 Updated Mar 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly