8000 SimonZhao777 (Warren Zhao) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View SimonZhao777's full-sized avatar
  • Ant Group
  • Shanghai

Block or report SimonZhao777

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

Python 364 23 Updated Sep 6, 2024

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 6,847 951 Updated Jul 3, 2024

An AI Hedge Fund Team

Python 28,293 4,904 Updated May 18, 2025

Linux multi-touch gesture recognizer

C++ 3,838 174 Updated Dec 1, 2024

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 22,107 2,741 Updated May 18, 2025

Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point and image-level flag annotation).

Python 14,467 3,498 Updated May 13, 2025

LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …

Python 23,717 6,440 Updated Jun 7, 2024

A curated list of graph-based fraud, anomaly, and outlier detection papers & resources

1,567 267 Updated Apr 26, 2025

🤗更优雅的微信公众号订阅方式,支持私有化部署、微信公众号RSS生成(基于微信读书)

TypeScript 7,158 1,231 Updated Apr 4, 2025

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

Python 4,975 545 Updated May 15, 2025

best way to save what you love

Svelte 32,024 2,673 Updated May 17, 2025

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 38,452 3,504 Updated May 18, 2025

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

Python 28,930 1,968 Updated Apr 28, 2025

Model Context Protocol Servers

JavaScript 47,109 5,313 Updated May 18, 2025

High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.

Python 9,746 834 Updated May 14, 2025

💯 Curated coding interview preparation materials for busy software engineers

TypeScript 126,243 15,396 Updated Apr 28, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.

JavaScript 44,185 4,324 Updated May 16, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 95,162 12,218 Updated May 18, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 49,111 5,976 Updated May 16, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…

Python 7,606 644 Updated May 18, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.

Go 140,895 11,784 Updated May 18, 2025

A toolkit for blockchain data collection

Python 144 28 Updated Mar 12, 2025

Awesome-RAG: Collect typical RAG papers and systems.

371 28 Updated Jan 23, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 7,919 488 Updated May 18, 2025

A curated collection of public industrial datasets.

HTML 156 23 Updated Apr 16, 2025

A collection of open datasets for industrial applications, divided by categories

80 11 Updated Apr 8, 2022

A topic-centric list of HQ open datasets.

63,145 10,117 Updated Nov 13, 2024

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 37,824 4,476 Updated Aug 19, 2024
Next
0