Starred repositories
This project hosts security advisories and their accompanying proof-of-concepts related to research conducted at Google which impact non-Google owned code.
A high-throughput and memory-efficient inference and serving engine for LLMs
Power management, monitoring and VirtualSMC plugin for AMD processors
RAPL power capping C interface with multiple implementations
Tools for experimenting with Running Average Power Limit (RAPL)
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
A curated list of open-source projects related to DeepSeek Coder
DeepSeek Coder: Let the Code Write Itself
[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The VMware Architecture Migration Tool (VAMT) is designed to provide an easy and automated process to cold migrate machines between clusters of different architecture types within the same vCenter …
A Python package for extending the official PyTorch that can easily obtain performance on Intel platform
AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and versatility of software and hardware.
Reference implementations of MLPerf™ training benchmarks
Awesome-LLM-Benchmark: List of benchmarks for Large-Language Models
A collection of benchmarks and datasets for evaluating LLM.
Rcmp: Reconstructing RDMA-based Memory Disaggregation via CXL
A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM
Caliptra IP and firmware for integrated Root of Trust block