-
NetEase
- China
- https://lpcong.blog.csdn.net/
Highlights
Stars
The Triton TensorRT-LLM Backend
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
A series of large language models developed by Baichuan Intelligent Technology
Fast and memory-efficient exact attention
Large Language Model Text Generation Inference
A high-throughput and memory-efficient inference and serving engine for LLMs
Home of StarCoder: fine-tuning & inference!
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
🦜🔗 Build context-aware reasoning applications
A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
Hacky repo to see what the Copilot extension sends to the server
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
A library for efficient similarity search and clustering of dense vectors.
Easily compute clip embeddings and build a clip retrieval system with them
A Vue 3 (and 2) reliable, simple and touch-ready panes splitter / resizer.
Probabilistic language based on pattern matching and constraint propagation, 153 examples
A server software reimplementation for a certain anime game.
在gevent中使用pycurl的方式 来源 https://bitbucket.org/denis/gevent-curl/src/default/geventcurl.py
A light-weight, no-dependency, vanilla JavaScript engine to drive user's focus across the page