- Guangzhou, China
Lists (13)
Sort Name ascending (A-Z)
Stars
Faker is a Python package that generates fake data for you.
🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
An implementation of iterative deep research using the OpenAI Agents SDK
verl: Volcano Engine Reinforcement Learning for LLMs
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Sky-T1: Train your own O1 preview model within $450
Fully local web research and report writing assistant
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Fully open reproduction of DeepSeek-R1
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
tiktoken is a fast BPE tokeniser for use with OpenAI's models.
Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Python tool for converting files and office documents to Markdown.
Text classification repository built with Torch, featuring training tricks, acceleration methods, and model optimization techniques like distillation, compression, and pruning. Supports single-labe…
心理健康大模型 (LLM x Mental Health), Pre & Post-training & Dataset & Evaluation & Depoly & RAG, with InternLM / Qwen / Baichuan / DeepSeek / Mixtral / LLama / GLM series models
Start building LLM-empowered multi-agent applications in an easier way.
[GenAI Application Development Framework] 🚀 Build GenAI application quick and easy 💬 Easy to interact with GenAI agent in code using structure data and chained-calls syntax 🧩 Use Agently Workflow t…
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Streamlit — A faster way to build and share data apps.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
一个用于训练句子embedding的工具,支持Cosent以及Simcse、infonce