Lists (2)
Sort Name ascending (A-Z)
Starred repositories
Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.
Official style files for papers submitted to venues of the Association for Computational Linguistics
🔥CVPR 2025 Multimodal Large Language Models Paper List
Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
A browser extension that helps users publish content to multiple social media platforms with one click.
User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)
🤯 Lobe Chat - an open-source, modern design AI chat framework. Supports multiple AI providers (OpenAI / Claude 4 / Gemini / DeepSeek / Ollama / Qwen), Knowledge Base (file upload / knowledge manage…
✨First Open-Source R1-like Video-LLM [2025/02/18]
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.
Python tool for converting files and office documents to Markdown.
End-to-end stack for WebRTC. SFU media server and SDKs.
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Integrate the DeepSeek API into popular softwares
yang123me / ShellClash
Forked from juewuy/ShellCrash在Linux环境下使用Shell脚本一键部署及管理Clash服务