Lists (1)
Sort Name ascending (A-Z)
Stars
A c++ trainable semantic segmentation library based on libtorch (pytorch c++). Backbone: VGG, ResNet, ResNext. Architecture: FPN, U-Net, PAN, LinkNet, PSPNet, DeepLab-V3, DeepLab-V3+ by now.
ResNet Implementation, Training, and Inference Using LibTorch C++ API
deep learning for image processing including classification and object-detection etc.
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
Web based picture annotator - can be used define areas within a picture that contain a title, a description and a link.
Vue Component for drawing annotation (rect, polygon, etc) using SVG element
Effortless data labeling with AI support from Segment Anything and other awesome models.
✏️ Web-based image segmentation tool for object detection, localization, and keypoints
Integrate deep learning models for image classification | Backbone learning/comparison/magic modification project
Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。
An anomaly detection library comprising state-of-the-art algorithms and features such as experiment management, hyper-parameter optimization, and edge inference.
图片标注网站,用fabric.js基于 canvas对图片进行标注,包括图形矩形、圆形、多边形、直线、线段、点标记车道线,障碍物,交通信号灯等
OCR, layout analysis, reading order, table recognition in 90+ languages
C++ and Python implementations of YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv9, YOLOv10, YOLOv11 inference.
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Go cross-platform library for sending desktop notifications, alerts and beeps
Real-time video and audio processing on Streamlit
YOLOv8 model for detection hard hats on people