-
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedMay 27, 2025 -
gaudi-pytorch-bridge Public
Forked from HabanaAI/gaudi-pytorch-bridgeC++ Apache License 2.0 UpdatedMay 26, 2025 -
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedFeb 24, 2025 -
-
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 22, 2024 -
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedNov 21, 2024 -
-
transformers Public
My take on transformers implementation with help from various sources publicly available in the internet
Jupyter Notebook UpdatedJul 23, 2024 -
rag_demo Public
Code to illustrate implementation of RAG using Langchain
-
optimum-habana Public
Forked from huggingface/optimum-habanaEasy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)
Python Apache License 2.0 UpdatedJan 31, 2024 -
-
-