8000 cubist38 (Gia Huy Vuong) / Starred · GitHub

More Web Proxy on the site http://driver.im/

cubist38

Follow

Gia Huy Vuong cubist38

Follow

19 followers · 35 following

HCMUS
Ho Chi Minh

Achievements

Achievements

Stars

ggml-org / ggml

Tensor library for machine learning

C++ 12,588 1,244 Updated May 25, 2025

google-ai-edge / LiteRT

LiteRT is the new name for TensorFlow Lite (TFLite). While the name is new, it's still the same trusted, high-performance runtime for on-device AI, now with an expanded vision.

C++ 443 59 Updated May 25, 2025

google-ai-edge / ai-edge-torch

Supporting PyTorch models with the Google AI Edge TFLite runtime.

Jupyter Notebook 593 76 Updated May 23, 2025

langchain-ai / langchain

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 108,145 17,610 Updated May 24, 2025

ml-explore / mlx-lm

Run LLMs with MLX

Python 833 105 Updated May 25, 2025

ggml-org / llama.cpp

LLM inference in C/C++

C++ 80,829 11,899 Updated May 25, 2025

ggml-org / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 40,213 4,271 Updated May 23, 2025

MeetKai / functionary

Chat language model that can use tools and interpret the results

Python 1,553 117 Updated May 6, 2025

trycua / cua

c/ua is the Docker Container for Computer-Use AI Agents.

Python 7,924 325 Updated May 25, 2025

ngxson / llama.cpp

Forked from ggml-org/llama.cpp

Forked from ggerganov/llama.cpp

C++ 12 5 Updated May 25, 2025

Blaizzy / mlx-vlm

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python 1,290 127 Updated May 24, 2025

cubist38 / mlx-openai-server

A high-performance API server that provides OpenAI-compatible endpoints for MLX models. Developed using Python and powered by the FastAPI framework, it provides an efficient, scalable, and user-fri…

Python 42 13 Updated May 24, 2025

mendableai / firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 38,818 3,555 Updated May 25, 2025

context-labs / mactop

mactop - Apple Silicon Monitor Top

Go 1,918 44 Updated Jan 11, 2025

HighDoping / Wan2.1

Forked from bakhti-ai/Wan2.1

Wan2.1 for Mac.

Python 31 2 Updated May 20, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,372 601 Updated May 20, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient MLA decoding kernels

Cuda 11,571 836 Updated Apr 29, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 14,617 1,836 Updated May 25, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 48,011 7,585 Updated May 25, 2025

ml-explore / mlx

MLX: An array framework for Apple silicon

C++ 20,712 1,213 Updated May 23, 2025

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 28,728 3,388 Updated Jan 26, 2025

xai-org / grok-1

Grok open release

Python 50,287 8,357 Updated Aug 30, 2024

sonhv3112 / SharkDeli

JavaScript 2 1 Updated Jan 11, 2023

sonhv3112 / Driving-Licence-Practice-Test

C++ 1 Updated Jul 7, 2022

sonhv3112 / Remote-Control-Computer

Tcl 1 1 Updated Jul 7, 2022

sonhv3112 / BigInt

C++ 1 Updated Jul 6, 2022

sonhv3112 / Search-Engine

C++ 2 Updated Jul 6, 2022

sonhv3112 / VNG

This repository keeps track of everything including all modules and projects that I do during my internship at VNG

Java 3 Updated Jan 11, 2023

huynhtuan17ti / lenet-5-cpp

LeNet-5 cpp implementation

C++ 1 Updated Jan 11, 2024

kaylode / vietnamese-ocr-toolbox

A toolbox for Vietnamese Optical Character Recognition.

C++ 116 32 Updated Oct 11, 2022

0