8000 Daniel-NJ / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View Daniel-NJ's full-sized avatar

Block or report Daniel-NJ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Make RSS 📰 great again with AI 🧠✨!!

Go 894 57 Updated May 19, 2025

一款提示词优化器,助力于编写高质量的提示词

TypeScript 5,067 640 Updated May 21, 2025

Measures the latency between CPU cores

Jupyter Notebook 1,217 88 Updated Aug 13, 2024

A community-oriented list of useful NUMA-related libraries, tools, and other resources

69 11 Updated Sep 28, 2020

Dynamic Instrumentation Tool Platform

C 2,826 578 Updated May 21, 2025

Loop Kernel Analysis and Performance Modeling Toolkit

Jupyter Notebook 93 23 Updated Mar 19, 2025

Parallel solvers for sparse linear systems featuring multigrid methods.

C 750 207 Updated May 23, 2025

Material for gpu-mode lectures

Jupyter Notebook 4,475 450 Updated Feb 9, 2025

[MLSys'25] QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving; [MLSys'25] LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

C++ 682 45 Updated Mar 6, 2025

人人都能用英语

TypeScript 26,198 3,887 Updated Apr 13, 2025

Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation

1,549 165 Updated Apr 13, 2025

A tutorial on RDMA based programming using code examples

C 549 151 Updated Jan 3, 2020

Large Language Model Text Generation Inference

Python 10,149 1,193 Updated May 23, 2025

Universal LLM Deployment Engine with ML Compilation

Python 20,662 1,735 Updated May 1, 2025

High Performance Linpack for Next-Generation AMD HPC Accelerators

C++ 53 23 Updated May 21, 2025

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,243 1,584 Updated May 23, 2025

Port of OpenAI's Whisper model in C/C++

C++ 40,170 4,260 Updated May 23, 2025

LLM inference in C/C++

C++ 80,741 11,878 Updated May 23, 2025

Scheduler for sub-node tasks for HPC systems with batch scheduling

Rust 413 31 Updated May 23, 2025

Repository contains scripts to run Graph500 benchmark on Salomon cluster

Shell 1 Updated Dec 11, 2018

The simplest way to run LLaMA on your local machine

CSS 13,078 1,397 Updated Jun 18, 2024

High accuracy RAG for answering questions from scientific documents with citations

Python 7,369 725 Updated May 21, 2025

Zstandard - Fast real-time compression algorithm

C 24,958 2,243 Updated May 20, 2025

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

C++ 73,381 8,015 Updated Mar 19, 2025

GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as…

C++ 1,333 555 Updated Feb 15, 2025

This is an online course where you can learn and master the skill of low-level performance analysis and tuning.

C++ 3,083 292 Updated May 19, 2025

The book "Performance Analysis and Tuning on Modern CPU"

TeX 3,134 212 Updated Feb 20, 2025

A benchmark for low-level CPU micro-architectural features

C++ 723 64 Updated Feb 8, 2022

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 108,002 17,583 Updated May 22, 2025
Next
0