8000 cubist38 (Gia Huy Vuong) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View cubist38's full-sized avatar

Block or report cubist38

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Tensor library for machine learning

C++ 12,588 1,244 Updated May 25, 2025

LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-device AI, now with an expanded vision.

C++ 443 59 Updated May 25, 2025

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Jupyter Notebook 593 76 Updated May 23, 2025

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 108,145 17,610 Updated May 24, 2025

Run LLMs with MLX

Python 833 105 Updated May 25, 2025

LLM inference in C/C++

C++ 80,829 11,899 Updated May 25, 2025

Port of OpenAI's Whisper model in C/C++

C++ 40,213 4,271 Updated May 23, 2025

Chat language model that can use tools and interpret the results

Python 1,553 117 Updated May 6, 2025

c/ua is the Docker Container for Computer-Use AI Agents.

Python 7,924 325 Updated May 25, 2025

Forked from ggerganov/llama.cpp

C++ 12 5 Updated May 25, 2025

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 1,290 127 Updated May 24, 2025

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-fri…

Python 42 13 Updated May 24, 2025

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 38,818 3,555 Updated May 25, 2025

mactop - Apple Silicon Monitor Top

Go 1,918 44 Updated Jan 11, 2025

Wan2.1 for Mac.

Python 31 2 Updated May 20, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,372 601 Updated May 20, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,571 836 Updated Apr 29, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 14,617 1,836 Updated May 25, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 48,011 7,585 Updated May 25, 2025

MLX: An array framework for Apple silicon

C++ 20,712 1,213 Updated May 23, 2025

The official Meta Llama 3 GitHub site

Python 28,728 3,388 Updated Jan 26, 2025

Grok open release

Python 50,287 8,357 Updated Aug 30, 2024
JavaScript 2 1 Updated Jan 11, 2023
C++ 1 Updated Jul 6, 2022
C++ 2 Updated Jul 6, 2022

This repository keeps track of everything including all modules and projects that I do during my internship at VNG

Java 3 Updated Jan 11, 2023

LeNet-5 cpp implementation

C++ 1 Updated Jan 11, 2024

A toolbox for Vietnamese Optical Character Recognition.

C++ 116 32 Updated Oct 11, 2022
Next
0