Lists (32)
Sort Name ascending (A-Z)
3D-Synthesis
AIGC-Video
Audio
Data & Data Process
DataLocation
Locate data on map... to lon/latDepth
Detection+Segmentation
Distill
Embody AI
Face
Fake/Defake
GAN
Hardware/Accelerate
Image Quality
图像/视频质量评估KG
KnowledgeGraph
LLM
LLM-Agent
LLM-Code
Multi-Modal
OSINT
Pretraining-CV
ReinforceLearning
StableDiffusion
TimeSeries
Tracking
Translation
Video
VLM
机器学习工具及平台
重要信息汇集
非机器学习工具及平台
Starred repositories
An open-source AI agent that brings the power of Gemini directly into your terminal.
Demo of a customer service use case implemented with the OpenAI Agents SDK
The Cursor for Designers • An Open-Source Visual Vibecoding Editor • Visually build, style, and edit your React App with AI
A set of tools that gives agents powerful capabilities.
Integrate cutting-edge LLM technology quickly and easily into your apps
Adding guardrails to large language models.
LLM-based ontological extraction tools, including SPIRES
A model-driven approach to building AI agents in just a few lines of code.
Jan is an open source alternative to ChatGPT that runs 100% offline on your computer
A curated list of awesome commands, files, and workflows for Claude Code
About This repository is a curated collection of the most exciting and influential CVPR 2025 papers. 🔥 [Paper + Code + Demo]
Time-R1: Framework and resources for endowing LLMs with comprehensive temporal reasoning (understanding, prediction, creative generation) using a novel three-stage RL curriculum. Includes the Time-…
Get started with building Fullstack Agents using Gemini 2.5 and LangGraph
A lightweight LMM-based Document Parsing Model
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
🌐 WebWalker [ACL2025] & WebDancer [Preprint]
Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.
11 Lessons to Get Started Building AI Agents
This is a simple demonstration of more advanced, agentic patterns built on top of the Realtime API.
Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs
Official repository for "Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment"
Code for the paper: "Learning to Reason without External Rewards"
An agentic company research tool powered by LangGraph and Tavily that conducts deep diligence on companies using a multi-agent framework. It leverages Google's Gemini 2.0 Flash and OpenAI's GPT-4.1…
直播带货工具,支持抖音小店、巨量百应、抖音团购、小红书千帆、视频号、快手小店平台,能自动弹窗,自动发言,AI助力回复
Curated list of awesome Cursor Rules .mdc files
🚀 Next.js + Tailwind CSS + TypeScript dashboard template with auth, i18n, 8 pages, 4 themes and 14 data charts
Train your Agent model via our easy and efficient framework
Learn Low Level Design (LLD) and prepare for interviews using free resources.
Learn System Design concepts and prepare for interviews using free resources.