8000 partht92 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View partht92's full-sized avatar

Highlights

  • Pro

Block or report partht92

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Jupyter Notebook 12,958 1,509 Updated Jun 4, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 108,962 17,732 Updated Jun 6, 2025
< 8000 div class="d-inline-block mb-1">

S-LoRA / S-LoRA

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Python 1,830 109 Updated Jan 21, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 304,532 50,378 Updated May 21, 2025

JA3 is a standard for creating SSL client fingerprints in an easy to produce and shareable way.

Python 2,945 302 Updated May 1, 2025

LLM inference in C/C++

C++ 81,428 12,026 Updated Jun 6, 2025

LLM powered retrieval engine designed to process a ton of sources to collect a comprehensive list of entities.

TypeScript 495 53 Updated May 7, 2024

Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …

Python 37,957 3,641 Updated Jun 5, 2025

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 6,362 1,142 Updated May 22, 2025

Annotated version of the Mamba paper

Jupyter Notebook 484 18 Updated Feb 27, 2024

Mamba SSM architecture

Python 15,030 1,326 Updated May 25, 2025

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,183 275 Updated May 11, 2025

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 16+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 8,211 664 Updated Jun 6, 2025

Large Language Model Text Generation Inference

Python 10,189 1,194 Updated Jun 6, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 22,727 2,505 Updated Aug 12, 2024
Python 8,626 507 Updated Oct 9, 2024

MLX: An array framework for Apple silicon

C++ 20,835 1,223 Upda 8000 ted Jun 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 49,051 7,822 Updated Jun 6, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 41,699 6,949 Updated Dec 9, 2024

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,995 216 Updated May 21, 2025

FastAPI framework, high performance, easy to learn, fast to code, ready for production

Python 85,965 7,450 Updated Jun 6, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 38,452 2,935 Updated Jun 6, 2025

Raising the Cost of Malicious AI-Powered Image Editing

Jupyter Notebook 600 47 Updated Feb 27, 2023

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

54,585 5,822 Updated Jun 4, 2025

Tacotron 2 - PyTorch implementation with faster-than-realtime inference

Jupyter Notebook 5,231 1,417 Updated Jun 12, 2024

A collaboration friendly studio for NeRFs

Python 10,283 1,434 Updated May 21, 2025

A Unified Framework for Surface Reconstruction

Python 2,047 194 Updated Jul 11, 2024

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 22,037 2,854 Updated Aug 15, 2024
Next
0