-
Independent Researcher
- beijing
- https://dw-yejing.github.io
Lists (1)
Sort Name ascending (A-Z)
Starred repositories
A markdown parser and compiler. Built for speed.
For developers, who are building real-time data-driven applications, Redis is the preferred, fastest, and most feature-rich cache, data structure server, and document and vector query engine.
Online playground for OpenAPI tokenizers
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
Simple, safe way to store and distribute tensors
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
PowerPoint-ist(/'pauəpɔintist/), An online presentation application that replicates most of the commonly used features of MS PowerPoint, allowing for the editing and presentation of PPT online. Sup…
Streamlit — A faster way to build and share data apps.
Netease Youdao's open-source embedding and reranker models for RAG products.
Question and Answer based on Anything.
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
the AI-native open-source embedding database
Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,思想类,数学类,人物传记书籍
a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
Git with a cup of tea! Painless self-hosted all-in-one software development service, including Git hosting, code review, team collaboration, package registry and CI/CD
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
Typer, build great CLIs. Easy to code. Based on Python type hints.
This repo provides the server side code for llmsherpa API to connect. It includes parsers for various file formats.
This is an implementation of the paper: Searching for Best Practices in Retrieval-Augmented Generation (EMNLP2024)
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…