Lists (3)
Sort Name ascending (A-Z)
Stars
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
Detection and Recognition Container number.
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTor…
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Proceed with text detection only in the selected area of the image
A Conversational Speech Generation Model
🌐 Make websites accessible for AI agents. Automate tasks online with ease.
qosmio / openwrt-ipq
Forked from openwrt/openwrtNSS Fork of OpenWrt targeting Qualcomm IPQ807x/6018
No fortress, purely open ground. OpenManus is Coming.
一个 Openwrt 标准的软件中心,纯脚本实现,只依赖Openwrt标准组件。支持其它固件开发者集成到自己的固件里面。更方便入门用户搜索安装插件。The iStore is a app store for OpenWRT
Solve Visual Understanding with Reinforced VLMs
Joplin - the privacy-focused note taking app with sync capabilities for Windows, macOS, Linux, Android and iOS.
Collection of publicly available IPTV channels from all over the world
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Fast and accurate automatic speech recognition (ASR) for edge devices
Instant voice cloning by MIT and MyShell. Audio foundation model.
基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
Let your Claude able to think
Multilingual Voice Understanding Model
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Samples code for world class Artificial Intelligence SoCs for computer vision applications.
Image Restoration Toolbox (PyTorch). Training and testing codes for DPIR, USRNet, DnCNN, FFDNet, SRMD, DPSR, BSRGAN, SwinIR
Virtual whiteboard for sketching hand-drawn like diagrams
Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis (Machine Intelligence Research 2023)
KAIST Multispectral Pedestrian Detection Benchmark [CVPR '15]