Stars
Official inference repo for FLUX.1 models
中文文档理解多模态语言模型,支持多模态文档信息抽取,文档embedding
This repo provides Geometric LayoutLM for Vietnamese document and code for export to ONNX
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.
ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.
Examples and guides for using the OpenAI API
A collection of libraries to optimise AI model performances
Final Project from Artificial Neural Networks and Deep Learning
Text-to-Image generation. The repo for NeurIPS 2021 paper "CogView: Mastering Text-to-Image Generation via Transformers".
DALL·E Mini - Generate images from a text prompt
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
High-Resolution Image Synthesis with Latent Diffusion Models
🙈程序员找工作黑名单,换工作和当技术合伙人需谨慎啊 更新有赞
Neural Network Blocks - Collect all kinds of fancy model blocks for you to build more powerful neural network model.
An Open Source Machine Learning Framework for Everyone
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
ZJU-lishuang / yolov5
Forked from ultralytics/yolov5YOLOv5 in PyTorch > ONNX > CoreML > iOS
yolov5 prune,Support V2, V3, V4 and V6 versions of yolov5
mobilev2-yolov5s剪枝、蒸馏,支持ncnn,tensorRT部署。ultra-light but better performence!